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I, Audrey D. Goddard, Ph.D. do hereby declare and say as follows: 

1 . I am a Senior Clinical Scientist at the Experimental Medicine/BioOncology, Medical 
Affairs Department of Genentech, Inc., South San Francisco, California 94080. 

2. Between 1 993 and 200 1 , 1 headed the DNA Sequencing Laboratory at the Molecular 
Biology Department of Genentech, Inc. During this time, my responsibilities included the 
identification and characterization of genes contributing to the oncogenic process, and determination 
of the chromosomal localization of novel genes. 

3. My scientific Curriculum Vitae, including my list of publications, is attached to and 
forms part of this Declaration (Exhibit A). 
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4. I am familiar with a variety of techniques known in the art for detecting and 
quantifying the amplification of oncogenes in cancer, including the quantitative TaqMan PCR (i.e., 
"gene amplification") assay described in the above captioned patent application. 

5. The TaqMan PCR assay is described, for example, in the following scientific 
publications: Higuchi et al % Biotechnology 10:413-417 (1992) (Exhibit B); Livak et a!. y PCR 
Methods Appl. 4:357-362 (1995) (Exhibit C) and Heid et aL, Genome Res. 6:986-994 (1996) 
(Exhibit D). Briefly, the assay is based on the principle that successful PCR yields a fluorescent 
signal due to Taq DNA polymerase-mediated exonuclease digestion of a fluorescently labeled 
oligonucleotide that is homologous to a sequence between two PCR primers. The extent of 
digestion depends directly on the amount of PCR, and can be quantified accurately by measuring the 
increment in fluorescence that results from decreased energy transfer. This is an extremely sensitive 
technique, which allows detection in the exponential phase of the PCR reaction and, as a result, 
leads to accurate determination of gene copy number. 

6. The quantitative fluorescent TaqMan PCR assay has been extensively and 
successfully used to characterize genes involved in cancer development and progression. 
Amplification of protooncogenes has been studied in a variety of human tumors, and is widely 
considered as having etiological, diagnostic and prognostic significance. This use of the quantitative 
TaqMan PCR assay is exemplified by the following scientific publications: Pennica et aL, Proc. 
Natl. Acad. Sci. USA 95(25): 14717-14722 (1998) (Exhibit E); Pitti et aL, Nature 
396(6712):699-703 (1998) (Exhibit F) and Bieche et a/. Jnt J. Cancer 78:661-666 (1998) (Exhibit 
G), the first two of which I am co-author. In particular, Pennica et aL have used the quantitative 
TaqMan PCR assay to study relative gene amplification of WISP and c-myc in various cell lines, 
colorectal tumors and normal mucosa. Pitti et aL studied the genomic amplification of a decoy 
receptor for Fas iigand in lung and colon cancer, using the quantitative TaqMan PCR assay. Bieche 
et aL used the assay to study gene amplification in breast cancer. 
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7. It is my personal experience that the quantitative TaqMan PCR technique is 
technically sensitive enough to detect at least a 2-fold increase in gene copy number relative to 
control. It is further my considered scientific opinion that an at least 2- fold increase in gene copy 
number in a tumor tissue sample relative to a normal (i.e., non-tumor) sample is significant and 
useful in that the detected increase in gene copy number in the tumor sample relative to the normal 
sample serves as a basis for using relative gene copy number as quantitated by the TaqMan PCR 
technique as a diagnostic marker for the presence or absence of tumor in a tissue sample of unknown 
pathology. Accordingly, a gene identified as being amplified at least 2-fold by the quantitative 
TaqMan PCR assay in a tumor sample relative to a normal sample is useful as a marker for the 
diagnosis of cancer, for monitoring cancer development and/or for measuring the efficacy of cancer 



that all statements made on information and belief are believed to be true. I declare that these 
statements were made with the knowledge that willful false statements and the like so made are 
punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of the United States 
Code, and that such willful false statements may jeopardize the validity of the application or any 
patent issuing thereon. 



therapy. 



8. 



I declare further that all statements made herein of my own knowledge are true and 
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AUDREY D. GODDARD, Ph.D. 



Genentech, Inc. 110 Congo St. 

i DMA Way San Francisco, CA, 941 31 

South San Francisco, CA, 94080 415.841.0154 

650.225.6429 415.819.2247 (mobile) 

g0ddarda@9cne.com agor1ciard@pacbell.net 

PROFESSIONAL EXPERIENCE 

Genentech, Inc. 1 993-present 

South San Francisco, CA 

2001 - present Senior Clinical Scientist 
Experimental Medicine / BloOncology, Medical Affairs 

Responsibilities: 

» Companion diagnostic oncology products 

• Acquisition of clinical samples from Genentech's clinical trials for translations! rvseerch 

• Translation*! research using clinical specimen end data for drug development and 
diagnostics 

• Member of Development Science Review Committee, Diagnostic Oversight Team, 21 CFR 
PartHSubteam 

interests: 

« Ethical and lege! implications of experiments with clinical specimens and data 

• Appiicathn of pharmacogenomics in clinical trials 



1996-2001 Senior Scientist' 

Head of the DNA Sequencing Laboratory, Molecular Biology Department, Research 
Responsibilities: . 

• Management of a laboratory of up to nineteen -including postdoctoral fellow, associate 
scientist, senior research associate and research assistants/associate levels 

• Management of a $750K budget 

• DNA sequencing core facility supporting a 350+ person research facility. 

• DNA sequencing for high throughput gene discovery, - ESTs, cDNAs, and constructs 

• Genomic s&quence analysis and gene Identification 

• DNA sequence end primary protein analysis 

Research: 

» Cftfomosoma//ccaffcaf/on or*nove/ genes 

• Identification and characterization of genes contributing to the oncogenic process 

• Identification and characterization of genes contributing to Inflammatory dlseasas 

« Design and development of schemes for high throughput genomic DNA sequence analysis 
9 Candidate gene prediction and evaluation 
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1993-1998 Scientist 

Head of the DNA Sequencing Laboratory, Molecular Binlngy Department, Research 



Responsibilities 

• DNA sequencing core facility supporting a 350+ person research facility 

• Assumed responsibility for a pre-existing team of five technicians and expended the group 
into fifteen, introducing a tevei of middle management and additional areas of research 

• Participated In the development of the basic plan hr high throughput secreted protein 
discovery program - sequencing strategies, data analysis and tracking, database design 

• High throughput EST and cDNA sequencing for new gene Identification. 

« Design end implementation of analysis tools required for high throughput gene identification. 

• Chromosomal localization of genes encoding novel secreted proteins. 

Research: 

• Genomic sequnnce scanning for new gene discovery, 

• Development of signal peptide selection methods. 

• Evaluation of candidate disease genes. 

t Growth hormone receptor gene SNPs in children with Idiopathic short stature 

Imperial Cancer Research Fund 1089-1992 
London, UK with Dr. Ellen Solomon 

6/69-12/92 Postdoctoral* Fellow 

• Cloning and characterizaition of the genes fused at the acute prnmyelocytic leukemia 
translocation breakpoints on chromosomes 17 and 15, 

• Prepared a successfully funded European Union multi-center grant application 

McMaster University 1983 
Hamilton, Ontario, Canada with Or, G. D, Sweeney 

5/83 - S/83; NSERC Summer Student 

• in vitro metabolism of (3-naphthoflavone in C57BI/6J and DBA mice 



EDUCATION 



Ph.D. 



University of Toronto 
Toronto, Ontario, Canada. 
Department of Medical 
Biophysics, 



"Phenotyplc and genratypi'c effects of mutations In 
the hum*n retinoblastoma gene." 
Supervisor; Dr. R. A, Phillips 



Honours B.Sc McMaster University, 

"The In vitro metabolism of the cytochrome P-448 Hamilton, Ontario, Canada, 

inducer 0-naphthoflavone In C57BL/6J mic*." Department of Biochemistry 
Supervisor: Or. G. D. Sweeney 



1983 
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ACADEMIC AWARDS 

Imperial Cancer Research Fund Postdoctoral Fellowship 1 989-1 992 

Medical Research Council Studentship 19R3-198B 

NSERC Undergraduate Summer Research Award 1 983 

Society of Chemical Industry! Merit Award (Hons. Biochem.) 1 983 

Dr. Harry Lyman Hooker Scholarship 1 981-1983 

J.L.W. Gill Scholarship 1981-1982 

Business and Professional Woman's Club Scholarship 1980-1 981 

Wyerhauser Foundation Scholarship 1979-1980 



INVITED PRESENTATIONS 

Genentech'e gene discovery pipeline: High throughput identification, cloning and 
characterisation nf novel gejnes. Functional Genomics: From Genome to Funrtion, Litchfield 
Park, AZ, USA. October 2000 

High throughput identification, cloning end characterization of novel genes, G2K:BacK lo 
Science, Advances In Genome Biology and Technology I. Marco Island, FL, USA. February 
2000 

Quality control in DNA Sequencing: The use nf Phned end Phrap. Bay Area Sequencing 
Users Meeting, Berkeley, CA, USA. April 1999 

High throughput secreted (irotein identification and cloning. Tenth Internntional Genome 
Sequencing and Analysis Conference, Miami, FL, USA. Septemher 1998 

The evolution of DNA sequencing: The Genentech perspective. Bay Area Sequencing Users 
Meeting, Berkeley. CA. USA. May 1998 

Partial Growth Hormone Insensrttvlty: The role nf GH-receptor mutations In idiopathic Short 
Stature. Tenth Annual National CooperativeGnowth Study Investigaiors Meeting, San 
Francisco, CA, USA. October, 1996 

Growth hormone (GH) receptor defects are present In selected children with non-GH-defldent 
short stature* A moiecularfbasis for partial GH-insensitivity. 76 th Annual Meeting of The 
Endocrine Society, Anaheim, CA, USA. June 1994 

A previously uncharacterized gene/myi, is fused to the retinoic acid receptor alpha gene in 
acute promyelocyte leukemia. XV International Association for Comparative Research on 
Leukemia and Related Disease, Padua, Italy. October 1991 
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Jun. 25. 2002. Lawrence DA. Levin© AJ, 
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hormone Insensitivity syndrome. Patent Numo* 

2001 ' rorinwskl PJ Gurney AL Hillan K end Williams PM. NucMe 
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tnsens.t.v.ty syndrome. Patent NumD . Treatme nt of partial growth hormone 

s» ~ » Ju * 1997 

Multiple additional provisional applications tiled 
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PUBLICATIONS ad Comparative sequence analyst of 

Adams SH, chui C. Schiloach SL. Yu XX. Ood Moced in Hngm brown 
adipose tissue. Cloning, wg* ■35.142 

to obe 3 ity. 36 °- 135 142 ' nT JS Goddard AD . Yansura OG. 

Lee J. Ho WH. Maruoka M. Cnrpuz ffTBj«w J jDT. ^nd forth. 0, 

17 receptor homnlog IL-17Krn. ju i wood wl QocWard AD and 

oa^pmen,. MM 40* ^ GoddMd fc 09 vos m , sac WO. M 

binding 10 two distinct receptors. Sne** 290. 623-627. 

Son, Paw m «■» «■ ^i^^^S^SS^ST 

AnplicalionofcDNAmicroarrays'n^^^ 

developmont, and response to miury. Cmiletmm. ™ .,,9991 

^ ^ a uuiknn £W Mullins MC and Rosenthal A. (1999) 
0„o S. Brush J. T e raoM H. Goddard ™ U h " d6r ,i„ r „ uireS BMP, FGF8, and 

Oevdopmon. of no.adjan.ro,. i""S^^f5: 55S-566. 
mo nomeodomain prote,n s ouU«e,Pnox2A. N.™ 

S,onoO. MU rono.M.L U *.S.YeW^ 

Hilton K, Oodd.rd A and Gurnet A. L (19 99) F GFW. 
unioue spedlidry lor FGFR4. CXotaie 11. 729-735. 
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Yan M, Lee J, Schilbach S, Goddard A and Dixit V. (1999) mE1(X b novel cacpase 
recruitment domain-containing proapoptotic molecule. J. Biol. Chem. 274(15): 10287-10292. 

Gurney AL, Marsters SA, Huang RM. Pitt* RM, Mark DT, Baldwin DT, Gray AM, Dowd P, 
Brush J. Helden$ S, Schow P, Goddard AD, Wood Wl t Baker KP ( Godowski PJ and 
Ashkenazi A (1899) Identification of a new member of the tumor necrosis factor family and Us 
receptor, a human orthoiog of mouse GITR. Current Biology 9(4): 215-215, 

Ridgway JSB, Ng E, Kern JA ,Lee J ( Brush J, Goddard A and Carter P. (1999) Identification 
of a human anti-COSS single-chain Fv by *ubtractive panning of a phage library using tumor 
and nontumor cell lines. Cancer Research 59: 2713-2723, 

Piul RM, Masters SA, Lawrence DA. Roy M t Kischkel FC f Dowd P. Huang A, Oonahue CJ f 
Sherwood SW, Baldwin DT, Godowski PJ, Wood Wl, Gurney AL, "Milan KJ, Cohen RL, 
Goddard AD, Botstein D and Ashkenazi A. (1995) Genomic amplification of a decoy receptor 
for Fas llgand In lung and colon cancer. Nature 396(6712): 699-703. 

Pennica D, Swansan TA. Walsh JW, Roy MA, Lawrence DA, Lea J, Brush J, Taneyhill LA t 
Deuel B, Lew M, Watanabe C, Cohen RL, Melhem MP, Finley GG, Quirke P, Goddard AD, 
Hiuan KJ, Gurney AL t Boisteln D and Levine AJ. (1998) WISP genes are members of the 
connective tissue growth factor family that are up-regulated in wnt-1 -transformed celts and 
aberrantly expressed in human colon tumors. Proc, Natii. Acad Sci. USA. 95(25): 14717- 
14722. 

Yang RB, Mark MR, Gray A. Huang A, Xle MH, Zhang M. Goddard A ( Wood Wl, Gurney AL 
and Godowski PJ. (199S) Toll-like r&cepinr-2 mediates lipopolysaccharirie-induced cellular 
signalling, Nature 395(6699): 284-?fi8. 

Merchant AM. Zhu Z, Yuan JQ, Goddard A, Adams CW, Presta LG and Carter P. (1998) An 
efficient route to human bispedfic IgG. Nature Biotechnology 16(7): 677-861, 

Marsters SA, Sheridan JP, Pitti RM,- Brush J, Goddard A and Ashkenaai A. (199B) 
Identification of a ligand for the oearh-dnmain-containing receptor Apo3. Current Biology 8(9)' 
525*528, 

Xie J. Murone M f Luoh SM, Ryan A. Gu Q, Zhang C, Bonlfas JM, Lam CW. Hynee M, 
Goddard A, Rosenthal A, Epstein EH Jr. end de Sauvage FJ. (1998) Activating Smoothened 
muiations In sporadic basal-cell carcinoma. Nature, 391(6862): 90*92. 

Marsters SA, Sheridan JP, Pitti RM, Huang A, Skubatch M, Baldwin D t Yuan J, Gurney A, 
Goddard AD, Godowski P and Ashkenazi A, (1997) A nerval receptor for Apo2L/TRAIL 
contains a truncated death domain. Current Biohgy, 7(12): 1003-1006. 

Hynes M, Stone DM, Dowd M, Pitta-Meek 5, Goddard A, Gurney A and Rosenthal A (1997) 
Control of cell pattern in the neural tube by the zinc finger transcription factor G/M . Neuron 
19: 15-26. - 

Sheridan JP, Marsters SA, Pitti RM, Gurney A., Skubatch M, Baldwin D t Ramakrlshnan l, 
Gray CL, Baker K, Wood Wl, Goddard AD. Godowski P, and Ashkenazi A. (1997) Control of 
TRAIL-Induced Apoptosls hy a Family of Signaling and Decoy Receptors. Science 277 
(5327): 818-821. 
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Goddard AO, Dowd P. Chernaueek S t GeffnarM. Gertner J, Hint* R, Hopwood N, Kaplan S, 
Plotnick L, Rogol A, Roaenfield R. Saenqer P. Mauras N, Hershkopf R, Anguio M and Atfie, K, 
(1997) Partial growth hormone insensitivity: The role of growth hormone receptor mutations in 
idiopathic short stature. J. Pediatr. 131: $51-55. 

Klein RD, Sherman D, Ho WH, Stone D, Bennett GL Moffat B ( Vandlen R, Simmons L, Gu Q, 
Hongo JA. Devaux B ( Poulsen K, Armaninl M, Noraki C, Asai N, Goddard A, Phillips H, 
Henderson CE, Takahashi M and Rosenthal A. (1937) A GPWinkwj protein that Interacts with 
Ret to form a candidate neurturin receptor. Nature. 387(6834): 717-21 . 

Stone DM, Hynes M, Armanini M, Swansnn TA. Gu Q, Johnson RL. Scott MP, Pennica 0, 
Goddard A, Phillips H, Noli M, Hooper JE, de Sauvage F and Rosenthal A. (1996) The 
tumour-suppressor gene patched encodes a candidate receptor for Sonic hedgehog. Nature 
384(6605): 129-34. 

Msrsters SA. Sheridan JP, Donahue CJ, Pitt! RM, Gray CL, Goddard AD, Bauer KD and 
Ashkenazi A. (1996) Apo3, a new member of the tumor necrosis factor receptor family, 
contains a death domain and activates apoptosis and NF-kappa 0. Current Biology 6(12): 
1669-76. ' * 

Rothe M, Xiono J, Shu HB, Williamson K. Goddard A and Goedriei DV. (1906) l-TRAF is a 
novet TRAF-interacting protein that regulates f RAF-mediated signal transduction. Proc. Netl. 
Acad. Sci 1/SA93: 6241-6246. 

Yang M. Luoh SM, Goddard A. Reiliy D, Henzel W and Bass S. (1996) The bglX gene 
located at 47,8 min on the Escherichia coll chromosome encodes a periplasms heta- 
gluC06idase, Microbiology 142: 1659-65. 

Goddard AO and Black DM. (1 996) Familial Cancer In Molecular En d ocrinology of Cancer. 
Waxman, J. Ed. Cambridge University Press, Cambridge UK, pp.1 &7-215. 

Treanor JJS, Goodman I, de Sauvage F, Stone DM. Poulson KT, Beck CD, Gray C. Armanini 
MP, Pollocks RA, Hefti F, Phillips HS, Goddard A, Moore MW, BuJ-Bello A, Davis AM, Asai N, 
Takahashi M. Vandlen R, Henderson CE and Rosenthal A. (1996) Characterization of a 
receptor for GDNF. Nature 382* B0-83. 

Klain RO, Gu 0, Goddard A and Rosenthal A. (1996) Selection for genes encoding secreted 
proteins and receptors. Proc. Natl. Acad. Sci. USA 93: 7108-7113. 

Winslow JW, Moran P, Valverde J, Shlh A, Yuan JQ, Wong SC, Tsai SP. Goddard A, Hansel 
WJ, Hefti F and Caras I. (1995) Clnning of AL-1, a Hgand for an Eph-related tyrosine kinase 
reneptor involved In axon bundle formation." Neuron 14: 973-981. 

8ennett BD, Zeigler-FC, Gu G, Fendiy B, Goddard AD, Glllett N and Matthews W. (1995) 
Molecular cloning of a llgand for the EPH-ralated receptor protein-tyroslne kinase Htk. Proc. 
NatK Acad. Set. USA 92: 1866-1870, 

Huang X, Yuang J, Goddard A. Foulis A, James RF, Lernmark A. Pujol-Borrell R, 
Rabinovitch A ( Somrua N and Stewart TA. (1995) interferon expression in the pancreases of 
patients with type I diabetes. Diabetes 44. 658-664, 

Goddard AO, Yuan JO, Fairbaim L, Dexter M, Barrow J, Kozak C and Solomon E. (19S5) 
Clonlno of the murine homolog of the leukemia-associated PML gene. Mammalian Genome 
d:732->37. 
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Goddard AD, Covella R, Luoh SM, Clackson T, Attie KM, Gesundhflit N. Bundle AC, Wells 
JA, Carlsson LMTI and The Growth Hormone Inseneitlvlty Study Group. (1985) Mutations of 
the growth hormone receptor in children with Idiopathic short stature. N. EngL 1 Med. 333: 
1 093-1 09S. 

Kuo 3S, Moran P ( Grlpp J f Armanini M, Phillips HS, Goddard A and Caras IW, (1994) 
Identification and characterization of Back, a predominantly brain-specific: non-receptor protein 
tyrosine kinase related to Csk. J. Neurosd. Res. 38: 705-715, 

Mark MR, Scadden DT, Wang Z. Gu Qi, Goddard A and Godowski PJ. (1994) Rse, a novel 
receptor-type tyrosine kinase with homology to Axl/Ufo, is expressed at high levels In the 
brain. Journal of Biological Chemistry 269: 10720-10728, 

Borrow J, Shipley J, Howe K, Klely F, Goddard A, Sheer D, Srlvastava A r Antony AC, 
Roretos T, Mitelman F and Solomon E, (1994) Molecular analysis of simple variant 
translocations in ecute promyelocyte leukemia. Genes Chromosomes Cancer 9 234-243. 

Goddard AD and Solomon E. (1993) Genetics of Cancer. Adv. Hum. Genet. 21: 321-376. 

Borrow J, Goddard AD, Gibbons 6, Kar? F. Swirsky O, Fioretos T, Oube I, Winfield DA, 
Kingston J. Hegemeijer A, Rees JKH. Lister AT and Solomon E. (1992) Diagnosis of acute 
promyelocyte leukemia by RT-PCR: Detection of PML-RARA and RARA-PML fusion 
transcripts. Br. 1 Haematol. 82: 529-540. 

Goddard AO, Borrow J and Salomon E. (1392) A previously unchsracterlzed gene, PML, is 
fused to the rettnolc add receptor alpha gene In acute promyeiocytic leukemia. Leukemia 6 
Suppl3: 11 75-1 193. 

Zhu X, Dunn JM, Goddard AD, Squire JA, Becker A, Phillips RA and GaIHe BL. (1992) 
Mechanisms of I055 of heterozygosity in retinoblastoma. CytogenQl Cell. Genet 59; 248-252. 

Foulke$ W, Goddard A. and Patel K. (1991) Retinoblastoma linked with Seascale [tetter]. 
British Med, J. 302" 409. 

Goddard AD, Borrow J, Freemnnt PS and Solomon E. (1991) Characterization of 3 novel zinc 
finger gene disrupted by the t(l5;17) in acute pramyelncytic leukemia. Science 254: 1371- 
1374. 

Solomon E, Borrow J and Goddard AD. (1991) Chromosomal aberrations in cancer. Science 
254: 1153-1160. 

Pajunen L, Jones TA, Goddard A, Sheer D, Solomon E. Pihlajaniemi T and KMrikko Ki. . 
(1991) Regional assignment of the human gene coding for a multifunctional peptide (P4HB) 
acfing as the (3-subunlt of proiyl-4-hydroxylase and the enzyme protein disulfide isomerase to 
17q25. CytogenQt Cell Genet 56: 165-163. 

Borrow J, Black DM, Goddard AD, Yagle MK. Fri6Chauf A,-M and Solomon E. (1991) 
Construction and regional localization of a Not\ Unking library from human chromosome 17q. 
Genomics 10: 477-4B0, 

Borrow J, Goddard AD, She*r D and Solomon E. (1990) Molecular analysis of acute 
promyeiocytic leukemia breakpoint cluster region on chromosome 17. Science 249: 1577- 
1560. 
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Myers JC, Jones TA, Pohjoiainen E-R, Kadri AS, Goddard AD, Sheer D, Solomon E and 
Pihiajaniemi T. (1990) Molecular cloning of 5{IV) collagen and assignment of the gena to the 
region of the region of the X-chromosome containing the Alport Syndrome locus. Am. 1 Hum. 
Genet, 46: 1024-1033. 

Gallic BL t Squire JA, Goddard A, Dunn JM, Canton M, Hlnton D, Zhu X and Phillips RA. 
(1990) Mechanisms of oncogenesis In retinoblastoma, Lafi. invest. 62: 394-406. 

Goddard AD, Phillips RA Grcger V, Passarge E Hopping W, Gallie 8L and Horsthemke 6. 
(1090) Use of thR RB1 cDNA as a diagnostic probe In retinoblastoma families. C1in!cdf 
Genetics 37: 117-126. 

Zhu XP, Dunn JM, Phillips RA, Goddard AD, Paton K£, Becker A and Gallia BL. (1999) 
Germltne. hut not somatic, mutations of the KB1 gene preferentially involve the paternal 
allele. Nature 340: 312-314. 

Gallie BL Dunn JM, Goddard A, Becker A and Phillips RA. (19B8) Identification cf mutations 
In the putative retinoblastoma oene. In M olecular Binioov of The Eve: Genes. Vision a nd 
Ocular Disease . UCLA Symposia on Molecular and Cellular Biology, New Series, Volume 68, 
j, Piatigorsky, T. Shinohara and P.S. Zeienka, Eds. Alan R. Li*s, inc.. Now York, 1986, pp, 
427-43$. 

Goddard AD, Baiakier H, Canton M ; Dunn J, Squire J, Reyes E. Becker A, Phillips RA and 
Gallie BL (1938) Infrequent genomic rearrangement and normal expression of the putative 
RB1 gene in retinoblastoma tumors. Moi CelL BioL 8: 2032-2083. 

Squire J, Dunn J, Goddard A. Hoffman T, Musarella M, Willard HF, Becker AJ, Gailie BL and 
Phillips RA. (1988) Cloning of the esterase O gene: A polymorphic gene probe closely linked 
to the retinoblastoma locus on chromosome 13. Proa Natt. Acad. So*. USA 83: 6573-6577. 

Squire J, Goddard AD, Canton M, Becker A, Phillips RA and Gallie BL (1986) Tumour 
induction by the retinoblastoma mutation is independent of H-myc expression. Nature 322: 
555-557. 

Goddard AD, Huddle JA, Gallie BL and Phillips RA. (1985) Radiation sensitivity of fibroblasts 
of bilateral retinoblastoma patients as determined by mlcronucleus induction in vitro. Mutation 
Rssearch 152: 31-38. 
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liMULTiUIEOUS AMPUnUTION AND DETECTION OF 
[SPECIFIC DMA SEQUENCES 

Lusssll Hignchi*, Gavin Bollinger 1 , P. Scan Walsh and Robert Crifflth 

— v-.i n^.^ mr i*nn ««4 St.. PWrvvOIe. CA 9*006. 'Chiron Corpo radon. 1W" MnJ 3& ( £c 



nc Hclecular S^ms. fflc. 1400 32rf Su, Emeryville. CA 9*506. 'Chiron Corporadon. ] 
^COS. ,t CoTrttpor.<Utig author. 
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jWe have enhanced the polymerase chain 
reaction (PCR) such that specific I>NA 
*sequ€sic«fi can he detected -without opp.n- 
the reaction tube. This enhancement 
..juirca the addition of cthidium bromide 
f (EtBr) to a PCR- Since the fluorescence af 
itBr increases in the presence of double- 
si mnded (ds) DNA an increase in flvores? 
Iceaoe in such a PGR indicates a positive 
|jiapH£cation, which can be easily moni* 
fibred externally. In fact, amplification can 
contmumxsiy monitored In order to 
pbllow its progress. The ability to similiter 
|ieously amplify specific DNA sequences 
'^ad oatRct the product of the amplification 
both jimpliflcc and improves PCR and 
may facilitate its automation and more: 
widespread use in the clinic or in other 
"situations requiring high sample through- 
put. 

Ithougk (h* powaua) benefits of PGR 1 to clin- 
ical cijapuwrjcs are well known**, U is stall noi 
widely used in thil aetting, even though it ia 
four years ai»c« thermostable DNA polymer- 
ase PCR practical Some of the reasons for io slov 
jranrr arc high coat, Uck of auiomarion of pre- and 
?CR processing seeps, and false positive results from 
Sfryover-cofiuminattoa. The firtc r»o points arc related 
«Jtfta: labor is the largest contributor^ cost at che present 
o? PCK development Mort current assays require 
<^<5e form of ''dowuarrcani" processing once diermocy- 
Is dome in order w d«t©nmaft vnertier the caxget 
pNA sequence was present and h«s xmplifed. These 
(include UMA hybridbuju&n*- 0 , eel «karophcresis with or 
[jfthout us* of r«TTicckm dieesnon 7,5 , HPLC*. or capillary 
^tCQephoresh 10 . These mcihodfl ar« Ubor.iaeeme. H^ve 
"> throughput, and arc di&suk co Ruiosnate. "Ite tlaird 
Jm U also closely related to deviwtream prooe<«lng. 
jftc hindliiag or it* PCR product in th«e« downsorfiafla 
^)c«tc« incrKwir* the chances that amplified DNA via 
"p/nd through the typing lab, reaulrin^ in » risi at 




"carryover'* foJn* poticives in subsequ«:flc t<0CJig u « 

These dowewwaaa ptacesslng Keeps would b« elimi- 
nated if specific ajepfification and dorction of amplified 
Wa. mnk pkee simultaneously vftchin fta unopened re- 
action vessel. Amy* in which eueh differenc proceaaea take 
place without the need 10 sepanic reaction component* 
have been termed ^omo^enoous". No Truly homoge- 
neous PCX assay has eccn domcnttnied eti a^ic althoagh 
prosiT«*i towards this end baa been reported. Chebnb, et 
ol. 1 *, developed a ?CR product detecnon «hetne using 
Huorescene primen thai resulted in & fluore«e*nc PCR 
product. ALel*-«pecinc primers, each with diff«r«nr fluo- 
roceoe tagf, wore used to indicate die js ertOC TF < » ^ ^ c 
DKA. Howover, Uia tmincorporated ptimcrs tnu« still be 
removed iaa dowaccreain process in order to viiualbe the 
result Rcccntlr, Holland, et al, 13 . developed an wsay in 
<whjch the eado£«nou« 5' exonvidcase assay of T<uj DNA 
polymerase exploited to cleave 3. labeled oligonucleo- 
tide probe The probe would only cieavr if PCR amplifi- 
cation had produced its eomplemetiiary 5eo,ucn<:c. In 
order to detect th*i deavage products, however, a «uba«- 
quenr procoa U again no«»dod. 

We have developed » truly homogeneous assay for PGU 
anil PCR product detection h»s«d upon the grcady in- 
creased fluorescence ihai otbidiua. hi omide and other 
UNA binding dye: exhibit vhen they arc bound to da- 
13NA 14 - le . As cnitlined in Figurft 1, a prototypic PCR 
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ROilC 1 PTlndplc of j'analtao«oua at&pUAodou and detection Of 
PCX product, the cQmpon«nui of a PCS. cnciaining EiBr tha^ arc 
fiuort wot w» Uxtcd— EtEr us elf. ^tHt bound uu either saDN'A or 
dcCMA. There is a 'ar^e fiuorcAceiivc rnrwnoais^Bt'wbcn £iBr is 
botintt to DNA and biudinc » gr«wuV cnltanood wftea Xi(<A j$ 
doubk-jtrtnded. After «i&*«nt (pj cycles of £CR dtc u«c 
ina-*aa« m dUDNA raflulrs ia aoVSMonal £i£i binding and a a«t 
incr«wfi in total .fluorescence* 
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l Gel elcco«pHor»U of PCR assertion products oethc 



£tBr has no obvivua <ff*et o* the yield or jpcafinty of amplifi- 
cacioa. 
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nc&tt J (A) Fluon=<ence mcsuurcflacnu from PCP-s thai conolxv 
0.5 us/rci tcBr and that are specific for Y-chremo5omc rcpait 

CNAs Jjxci5ed. At ccch Elated cyde. One of th* fir« «p*yatt 
FCRs fu7each DMA uas removed Uo*. ^naocydMg and w 
fluoresce*:* meaiurcd. Uuiu of Ruorescairr are arWtrarr C»> 
U v photography of fcr «ufew <o.a mi Eppendcjf-jtric. Hrp£- 
pvifine roJcn>icowfug« tuUaJ contain^ "JJ- 
fau from £ aft nude UNA and ccnu ol ro*cw»« vxtrhoui any PNA, 



c*glns with prunere that are su^c-rrrandUrf DNA (>$- 
WAX dNTP«. and DN T A polrnrtrase. A* aoiouilt of 
dsDNA cootaininfl *e targer setyicne* (urget JiiNA) a 
also cvpicaljy present. Tins amount caa vary, dcpeadiAg 
ca mr applic^ion, from tin^c-cell amount <rt DNA to 
microgram per PC* 4 *, it fcC*r is present, the regents 
chat fluorocc, in order ofmcreaain? fluoxeiceiice. arc 
fine £cBr !t5elf t end Mr bound, to the ^K^w^ed 
DMA primers aad «o the o*ubJe.Strandcd tax** DNA (by 
its mwrcaladon bec— «* the stacked bases of the PNA 
double-helix). After the first dciuxxirauon cyde, target 
DNA wiU be larstJy angJe^andnl .After ft PCK. is 
completed, the most sienifceant ebaner u tJ» «flcw« ™ 
tKc amount of daDNA (the PGR product iudi) of up » 
several ralcrogTani5. Formerly tree EcBr « bow* to 
addidonai dsDNA, rcsuJtflg: in an iacrax ^ fluot«* 
etnee. There i3 also iome derate In Ac : amouAi ; of 
ssDNA pTim<rr. but became the buidai^ Of Mr to aaDNA 
u much lew iHan to dsDNA, affect of this ch^Jgc 00 
tkc coud fiuorecitnee of die 3«j»pk is small The auores- 
iDcreaw on oe mca»urcd by directing paianop 
iUurcination tKrougt the wzdb of Hit aaxplxfica W vc«3el 



before and after, or cveo condauously durinR, cKc^n^cv- 
cUog. 

PCK in the p/eeenco of RO»r, la ord«r w asse si chc 
affect Of EtBr £ PCR, aaiplificauow of cb* kuaaa HJ A 
DUat jrene 311 vrcro performed with th* dyd prescrtt ac 
eoncentradoM Irom 0.06 TO 8.0 wfai (• typiol conwju 
o^ioa of o3«d in *uinlrff of u«cl«c aods following 
ff d electrophone " " As 3hoMO ui ligur,: ^ € d 

Sc^ph^ rr^eled little or po di£feren^ ytcld 
^rqX of the Wfc<atta product whechcr icKr waj 
abater or procai at any ot iltfse concenuanon*. mdtcav 
inff chat KcBt doea not uihibic PGR, 

Detection, of Human V<hTDmotfom« «x>cl£c it- 
queaces. Scqucncc-rteciHc fluorescence «nWiocin*ac of 
Mr w a result of 5 C» detn-oMtrated in a teries cf 
Bmplihi^rions contain^ g 0.5 u-g/ml Etfrr <^d pntewte 
sDAcific to repeat X3NA sequences found on the human 
Y^hropiasoflie^, These PCKe uaidaUy con earned euher 
60 n ff mate. AO ng feo»J% 3 n 5 male burn^ or no DNA. 
Five replieace ?OU were for each DNA. Alter 0 
17, 21, 24 and 29 rydca of tliermtfcydifltf. a FGK for each 
DMA waa removed from th* daennocyclcr, and ks fiuo 



nuoracenc. can be drtecrod, the uureasc in DKA 
hecomxne li»«ar and not a ;pon^nt^ tinth cycle number. 
As shown, the fluorescence incrcaaed about Uiree-tolo 
over ^hc badc^ound fluorescence for the fcfia coaraia- 
in2 human male DNA, hue did not si^mncaiuly inacass 
for neeadvc control FOSLi. whicb contained cUoer no 
UNA or hutnoa femaH 3>NA- The more tnak J lis A 
pnuent u> bejin w«h^6U VOW 2 ng-thc fciver 
cycles were needed to give 3 aetecabic lo^a w floo- 
res«nct Gel dcCQ-ophoratU on ihc produc« ot these 
arftplincatiOQS aho^ecl that DNA frajfmenw of the ex- 
^Ll tiie were made in tbc male DNA co»o»W 
reacciont and rhac lUfle DNA syntUcslS took place in 

U additian. the Increase m fluorascence^as *iwaliWfi 
by simply laying the completed, unopened VCSa on a U V 
a ^Uununator and ph.^ogiuphiag chem ihrou S h a r« 
fUccr. Thifi is «howa iu fiffure 3B tor the re^noni thac 
bewin with 2 nff male DNA and tbo« with no DNA. 

Detection of specific allele* of the human ^;l*b» 
Ecac- In order to demonstrate that ihu appryjtfh ha= 
tdcquaee specficiry to aUr^ fenouc tcweoing, a Ucwcucn 
of die »c>ii-ceU ancmU lautauoa va$ pertorroed. > ifjur* 
4 shows the flaortscencc ftom «ompUwd aroplificanow 
concainmg EtBr (0.5 m/ml) « detected by phorograpiiy 
Of the reaction tub«s on a TJV transiUununatnr. Th«« 
reactions ^cre performed nane; pjAxnew specific for ev 
the! the vnid-rrpe ndtle-ceU mutanon ot the huma* 
SSobia ecne^.The spedfldty for each aHele is ^parted 
hy pfcdmr the ffickJ«-muunoD site at the wndnai » . 
nucteoOdc of on* prim^ By U5lng an appropriate 




a reacdSn -ich ^her the ^fld-type allel* .pecific ZfV 
cube) or 5l<*le-iO**U ep«ahc (right tube) pnmert. IWJ 
dirfcrent DNAtf vcre typ«d! DNA.froro a hocwyc^j-. 
witd-type p-eiobin indmWl (AA): frotn a h«wroiy«^- 
•kto MtoWn individual (AS); and from a homoay^ 

DNA w srari each PCT) anaiyaed la iriphcue (S p^ . 
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f rtnariens The DNA type vu raflecttd in th& 

c Itfivt. fluorescence Intensities in cath pair of completed 
I metrications. There was a significant iiwrcaaa in fltiorw- 
^cce only "here a ^globin allele DNA matchod the 
rimC see Vl'hcn measured on a jpectrofluorooieter 
Fri-a D0t sbowa), this Auorescenoe -waa about Oirea times 
thlc present m a Pt:* where both ^©bin aiM« were- 
mUmsccbtd to th* primer set Gel elccffophorosi* <«w»t 
ekownl ea»bl«b«d that this increase in fluorescence w« 
jue W tbc ayirtbcs* of nearly a microgram flf ft DM A 
fragment of the expected size for fr-globia. Thar- wai 
Kde ivndbe»s * f dsfcNA In TeatxioiM in vhich ine allele- 
♦periftc primer was mitmatcned 10 both afldca; 

Cofliinuoua o^idwring of a FCK- Uain? a fiW Ppdc 
U«vice Jt is po«siWe w direcL cxriiatioii Ulummadon trora 
'a 5pecn-c^oromet«ir to a PCR undergoing ^bermocydUig 
aaS so rerum ftuor««eertee to th.c spcurorluorozn«tcr. 
Tfcc fluorescence Teadout ai such an arrangement, di- 
r«cwrf 21 an E< fit-containing amplification al Y-chromo- 
" 3 om«5 specific sequences from S!o a$ of hlBnan male DMA, 
i- thewn in Figure 5. Tbe readout from a control PGR 
"rich no target DNA U «Jao shown. Thirty cycl« of POR 
vcre monitored for cock. 

fluorescence trace as a function of lime dearly 
shoves the effect of Lhc thcrmocyeling, Fluorescent imcn- 
; ^iy mei and falls inversely with wmpsrature. Tlic fluo- 
ii«c*ncc mcen&ity is suniraunt at the dctiaairadan tenti- 
iereture (9^Q and maximum at <h« aJUieaUng/eXtenalon 
umpflnrare <50°C). In tte nejpti^c^dtrol >CR. chesc 
RuorescffRCC maxima and minima do not change siffnifi- 
^e*ady over the chlrcy shcrmocyeles, indicadng thar ihrre b 
.t£tdc dsiJNA jynrhesw without di« appronnaze Tar^tt 
&Ma. and chere Is Jitdc if any bleaching of XtBr during 
(he continuous OlumkiatioQ of the eompla, 

In the PCK concainlnj male DNX, the fluonsscence 
eiaxiaia k the aimeaUng/excenaieai temporatuia begin io 
Ecrfflsc at abtiut 4000 sccoads of thonnc cycling, and 
conurtur: to increase wich time, injcUcadap cKat d«DNA ii 
bdnp producet L at » detectable Itvrt Ntur thai iK« fluo-, 
reaccncc minima at th* dfinaniratlon leropcrature do not 
iigm&andy in<Tea«c, presumably because ai thi* ic*ape^ 
«urc there is no dsDtNA for -ttBr to bind, Thua the ccure* 
pf the amplification is followed by rracking chc fluorai- 
;-ccnce increase at tho onneaitng tcmperamtc. Aaialyt^i of 
ihc producu of thce« two omplificatjoui by gtl clcciropho- 
jesia ihcr*cd a DKA fragmQat ot the expeaed. size for the 
•ibaie DMA ajneainin^ aajcnplfi and no detectable DNA 
ivnthcsu for the cemaol iampU. 

Downarrearo proceasea aucb as hybridkadon Co a 5C- 
qi^nce-SpedSc probe can enhance die tfpeciiiesiy of DMA 
,C;tec:iofl by PCIL The cUminaaon oi thes* proeetses 
.rasiat thax the spedficity of this homog«n*ouj assay 
'depends so/ely on dxu. of PGR. In dw case of cidOe>ceU 
idtiease, we have shown that PCS. aloaa bos sufficient UNA 
.sequence specificity to permit generic aer»«iin£ Using 
Appropriate ampKficidon condidoiu, thare is little non- 
5pcdtic production of dsDNA in iKe ab«<nce of the 
^Ppfopriice target ancle. 

.;. TKa sp«afidcy required to dcttci pathogens can be 
IPhre or less than that required 10 do genetic screening, 
fcpeodin jon the number Of pachogcos in tlac aamp]< and 
M ajijouat of other UNA that must be taken with chc 
'^ple. A difficult »rgwi is HI vr. which require* detection : 
& a v ir»l genome that can &e at the level of a few copies 
ft th*\mxid.s of bosi celli*. (^nmared with gcacaVc: 
^anvng, vhieb is performed on ceUS COncaining at ica*t 

^ ''?)* -copy o£ tht QaJ"g*E n«jU««», HIV o^necdon iraplfrw 

[#T*th more specificicy and the input t»f more total 
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4 UV photography of Pd M^cj eoncaioioff vnali^carionj 
wmrBtBr are spedfic eo -«l<l-cwe (A^ et su:We (S) alleles of 
the human fi-tfobin jrenc. Th« of wh pair of tubes eottcamfl 
allelC^apecihc !»unara to the vikUype aflefca, the ngfa tube 
prime** to ik« cickie allele. The uhgtormph w calten after 30 
«ycUo «f pCR. and the Input DNA3 «ad £e alleiea dticy connai 
are indicated. PUc/ tic of D^A w uecd to hcajn tC3C Trpms 
w^s done i« criplic«to (% pairs of TCRa) fof eaoh input DX r A 



f 




1 


Iff 


If 


tr 


t 


it 





25° C 



25 C 



20ng of mole DNA 



so 8 c 




94 C 



no DNA control 




2000 



ti 



4000 6000 



8000 



me 



S CoiuIcuiquc, reai*ttee monitwinjr oS a PCS- A &Ur opnc 

vu u*ed to carry exclwiiuu 2gbt » » p £fc in progress and «djo 
emitted lirfjt bsck w * fiu©t«*octer («rc Expenoiemal rroi w), 
Ampli£canoA usia^- human maU-CNA Spcoflc primer* in » PCK 
iiartinf "^hb fio ng «* human male DNA (cop), or in a towroi 
FCR without U^A (botiouj), vere oaeaatortd. ThlrTy Cydc3 of 
PCR were foflowed iwr eaob. Th* «wap*irawre cyded Utrrfoea 
94*C (denacuraaon) and fiO-C (annealing and externum). Note to 
tlic in«l« DNA PCM* the cydt <uwa«) <l*pen<io»t increase in 
Quofetcence ai tbe ajuicattoufcxsea^to MmpcratuJc- 
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DNA-ap w microgram amoimo^ «f «r ~ ^ 1 

starting DNA in « amplification j^eamly ^g^f 
l^tground Ruoretcne* over wiucix aay *Wftk»ral 
?uoSrp^ucea bv PC* muse bo 
*ddido Iia l complication tnai occur* T&*Z^$J*i- 
ccpy-nambcr b the formation of .he "pnflutr^rwr 
amfect This is the result of the ejsxcrwion of oun primer 
S^cScr primers »«npbte. Although th* occun 
infrequently. o5cc it occuw rhc cxtccMoc product U a 
aubaraW fbr PCR aimtfieacton. and a* compete with 
tnw PCR OTgcu if d>Q<* xargecs arc r»ft. ?he P™ncr- 
dirner product is o£ couwa dsDNA aad chu* w * potejflaal 
30 ar« of false signal in thi* homogeneous assay. 

To irvoreasc PCR ap«oncity and reduce th« effect Of 
prinur-durer amplifioiioa, ^ IttvettgWg a nu£ 

of approaches, including the use of 
*npKncaDons thai wkc place in a single wb^ and. tbC 
**oiW\ in which nowpecific wplAma u reduce* 
by tniains tin irapeivure of *c reaction before DNA 
synchois begird. PreUminary iwuJB^ usir^thefla *p- 
proaefaea tune* chat prirocr-dimcr * eff^vdy x«lu«d 
S3u » poSShle to detect the ineraa* bWr. fluores- 
cence in * PCR «sd*«*d by a sing* HIY gaum: m x 
background of 1U* celt* With larger -ranters Of cdb, ^ 
baeXmurd auorcccence cooiribuicd hy?ffcoona*;DNA 
become problettunc. 1*0 reduce An b^kgtouad. itmy 
be ooMdbSt w use ifcqufincc-spcofic DNA-buiiW dy«- 
thai can be made to prlfereimally bind J^^P^ucc 
genomic DNA by incorporating Ac dyM^ng; 
sequence into the PCR product ihroujrh a 5 add-on to 
che oliKonudeodd* primer* 4 - m r a 

Wc hjivc «How» that the dctecoon of fluorescence 
Kocratcd by an EcBr-oonulniag PCR le Btraa^btforward* 
§oih uncc PCR » complied and wndftuously dimng: 
thermocyciuig, Tbe *asc with which auwiaaooa Of spe- 
cific DNA detection can be accomplish** is the mofft 
promisee aspect of this assay. The flucracance anaiysa 
of completed PCR* is alr«*dy oo^ibk with «uftong instru- 
mentation in 96-well fontu^. In thia format, tbe auores- 
caooc in each PCR can be qiunTirarcd before, after, pel 
av«) ai Selected points caruiff thcrmocyoin|by ; mov\a B 
th« racfc of ?CSj co » 9b-rmcr<rwcli 'phw Kuorcscencc 

Tht tnstrumcru»ti'on no«*sary to coottouousry morutor 
muluolit T CRs fllmuic««<ouftly is also wapjeifl^pnnflp*. 
A dix*:t extension of tbo apparatus used hcre^ w*™? 
multiple fibeioptJo tmnxnut the eXdUoo^b^jmd^Pti. 
oroiwee emissions co and from multiple FCB*. Jb« 
to monitor muliipU: PCRb continuous^ may jl^w qu^La- 
uiadon of tzrtet DNA «py number. Ftgvrc 2 shows mat 
th« larger the amouat of s«rtii>g tared DNA, co oaer 
duripg PCR a fluoresce increase Is detected. Pre Uni3. 
nary experiments (Hipichi and DoUinger, «iLau«cnpt in 
*r«>ir»iion) with couanuoiAt monitoring havo ; 8itown z 
jcwUivicy co two-fold diffewnc** in iiiiuai wrg«t DNA 
conc<in<radon. ^ ' ' . 

CoavcTt«Ly. if the nvmW of target molecules 14 
Uo*rn-^* n can be ia genetic ecTeeoing---conQnuou* 
monitoring may provide a means oi defecting' false pos>- 
dvc nagaavc resuits, With a number ot 

carprt molecules, 1 true positive would ^^^^Sif 
fluorescence by a predlctabLc nunabor ot cydes of PGR. 
Tncrwecs io ftuorstcence detected bef or« or after that 
cycle would indicate potenual arufaeu. PaJsc ncsauv* 
rcsulcs due to, for example, lnhabitioa of UNA polymer- 
ase, may b«: detected by Including ^thin each PGR an 
lncfFidendy amplifying marker. This marker > res UJiS^ in a 
fluoresccivte incr««« oftly after a large number of cy- 
des— tnaiir more tkaa are necessary to (Utaet a tuc 



poaitive. Xf a sample ftdla to have a fluorexeac* **cc*ls* 
£te this many cycles inhibition msqr b "^f^ rp ^' 
in tbl* away. coaUu^iopC ^ dravr n ba«d on ^e pr^«i£« 
or ab»nce of fluoresce fl«nal alone 
be imoonanL In my «*cni. before wy teet based on thi» 
otoicSa is ready for <2m ciirdc an a^«r^nt of xta fais,. 
potSJerts^C ncjjadw rat« war need to be obtained using 
a lars« txurabCT «f Koown «aiaplca. 

r-eJ^Tis ctiameed upao bindin* di»NA nates « 

Sxe PCRuiSe. In »he fuww, «nsmimeii« t«cd upoi , chd. 
orineinU mav FaoEtate tW morr. wid«pr«»d use Qf ft.* 
E» a^plSss to dcmarul the higf throughput of 
jaiaples. 



Sh«AS0^ « *M MAi « u^U of 7W BMA 

CH26 and — 4 r 

Sroduci diluted from a previous T «^ uw "- — , 

Sr SicwA) ^ used * the omcenrranoti* lD^c»«a u i Figurj 
l5*ScUnr iffoc-eded tor W cycle i«» nodd 4x0. 

c^prcKr*m of for L cUn. danatarazwn and e<rC fbr 50, 
c^knftftttlui0 and 72*C for SO «<- exccnsioa. v 

YlTa«*Y iS 8 and e££ 50 ag male, 60 oe fc«al«. 5 n VK5? 

fnr l uilu. iiAina m w stctxyclc prceram- Tha numoer of cycles row 
^LLTu w^aa SSScd iTtSut 3. Ruorescence Miurc, 

100 ^lvoW u£iM 0^S ptffapl ef Mr were ^eP^JJ? 

teriobin spedfe primer^ u< 10 ( ^oU ead; i pftaei 
S>W^e dcrdo^ by Wu ej ^ { 

3«n»a KfltcPoTyflOUS for the *tUe trait (AS), or DNA ihM *« : 

W» nal ™to wvads sUek.3pecifie amobtaoor- So*?\f*^ 
Ifwr p\ac^e thcl^aioT. rito atop a »udd TU-W rranWDum^ 

Spex Eoboo. >?D- £«=ittt*on »ra* ^ <b* SOD cm band ^jn> 
SEfi n&d^dih with > 4» am cut-off a«rjM^| 
Irvtne, CA) w w»dud« sccond'Oidcr Ubt- tm^ij 

o^i^&m^ was ued « -^^S^ 

tp^ofluOTOincto- and «*tongfl deaenbed ab<^4 « 
ligllt t». «ad reeavc emttred heh t from. » ?«TK "_j„ eI1 j 
rfihe fiberupoc caUe w atUchefl with "S mu««*MP«jr J 



frjmw" W« «3 th« rocn Ughu l<«pi djmn^ *£g 



wm cowed *Ah rnujcni oi\ (2 diops) to prevent W*Li#! 
Thcr»«yd«g and flUOtocewe me«urement 
multwwSSy A nme-btisc scan wjdj t ^ second in"^© 11 .. 
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sCD-14 ELISA 



Trauma, Shock and Sep 



fTfta CD.u molecule is fixpressoc) on the surface of |SK offers zn B.I3A for quantfmtivo determinafon o( 
manocytes and seme macrophages. Membrarie- 1 - , soluble CO-14 in human cerum, -pUisrna, ceijl-culture 
■' bounci CO-14 is a receptor for llpopoiy^aa^iaricle .p3U^rnQtants arid other biological fluids. 
['"(LPS) compiexed to LPS-einding-Protcin (LBP), : The ^'.^fi[ay"Toaiures: 1? x 6 detcrminatianfi 




conrRntr? ( tiQn of its soluble form is altered ^updaf 
certain pathological conditionfi. Thara Is evidence far ' 

i; < w (mpoaant ro<© of with porytrauma, eeoeie, 

burnings and inflammations- 
Ounng esptic coi'dlilone and acute Infections it seajTW 
ffj be a prognostic marker nnd is therefore of vdus irv 

./-.rnonitonng these patients. 



(rniorotiter stripe), 
procoared wftn a sp/scific 
monoctonal eno'body, 
2x1 hour incubation, 
standard range: 3 96 na/ml 
detection limit 1 ng/ml 
CV:Intra^ and interassay < 6% 
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Oligonucleotides with Fluorescent Dyes at 
Opposite Ends Provide a Quenched Probe 
System Useful for Detecting PCR Product 
and Nucleic Acid Hybridization 

Kennoth J, Uvak, Susan J.A. Flood, Jeffrey Marm*ro, William Giusti, and Karin Deetz 

Poridn-Glmcr. Applied Ulojyiwrm DJvULan, V«aier CUy, CaUfonUa 94*04 
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The 5' nuclvac a PCR ***«jr dUtocto the 
Accumulation of tpecifle PCR i»rod**«& 
by hybridization and cleavage of « 
douhle-lahei«< fujoroatinlc probe 
during th« amplification reaction. 
The probe I* An ellaonudeotJda with 
both A reporter *lut>f**c« H| dyo *fkI * 
quencher dye attached. An In 
In rftpanar fluorescence Intcnirty In- 
dlcat** that the probe Dm lyrhridiied 
to th« t*rd,*| PCR product mJ fi*> 
been cleaved by tJitt S*-*^' nticla- 

olyUc activity of fro, DMA poi/m^™**. 
>n this tftudy, prObu with the 
(|U«n«h(tr dy» ^ttath«d ta nn Internal 
nucleotide wore «6mpir«d urttt* 
urobe* with the quencher dy+ at- 
tached to the 3'~end nucleotide, in *H 
caici, tha reporter dye was attached 
to thA 5' end. All Intact probec 
(Hr>w4d flenching of the reporter 
fhioresc ence, In general, probes with 
the quencher dye attached to the 3'« 
o«d nucleotide exhibited a larger sig- 
nal In the *' nucleate pcn nuay than 
the Internally labeled prober. It Is 
proposed thai tka largvr signal U 
caused by Increased likelihood of 
cleavage by Tqq dna poiymertuc 
wh«n the proha It hybridize*! to a 
template »tr*nd during PCR. Pr«b«» 
With the guenUier dye attached lo 
J'—nd nucleotide also exhibited 
an U r-eparU*- fluorescence 

fnWntUy wKm hybrid |»*cl to a conk- 
j>lcnienU»7 oArandi Tin*** otlae^ucfo* 
ofldes with reporter and qv«ncher 
dye* attached at oppoilfae end* can 
be UMd •« homo<a*aeoiif hybrtc/lxrf- 



(hi. mint mixtion of specific PCK prc>tf- 
act ih«( up« i| <36uDli>iaUclcd flttoro- 
(Cceic>rwhe w*« d«wc ilx.-tl Uy Lc« etal.* 1 ' 
Tize assay e*|jl»itr the S' ■ * 3" nude. 
o)yuc " activity «f 7«j J)NA poly- 
•Jitijtttf*'-^ uhd Is (H^jprdnjcd In i1«orc K 
Tlw lluoi'Ogfcnli; prulJtt t*CHl^ltfty Cjf an oil- 

£ftriud<KvUd*^U)i u wjwrtw fluorwcent 
Oyn. ^uv li « n fluyiybwlu, attached to 
life 5 ' cjtU; uuO u quciahcr dye, jvdi o5 o 
rhoian^inc;, auac^icd Internally, When 
tUu t|ucirwc<tin |e excited by (radiation., 
lu AvfuTwccni cmiislon will be 

quenched ii die iliuOattiitic li ClnH« 

cnouitt to be .ejneftcd through ih* |V<»- 
ccis ot : duoire.<c^lIa , vnvryy txan^tftr 
(l-l>r) M ^ IWins PCIt, !/ ihe prolx li »iy^ 
bH<<ir!ed to a tempi aU *tmnvf, T<#^ DMA 
pprymcrwc , will cleave Oie probe b«- 
ciuseipt Us inherent 5 J -» 3' nudcy lytic 
actiVfly- 3f the d«3v^e cccurs between 
Dhv fiuurescem and rbodariiinc dyo* # « 
on incicw Sin £tu»iosvvin fluorv- 
ccncc imm^ry became the fJucvrcAc*to 
U uo lonuer ciuwricbcfi. The Ineieasc in 
{luuecsccin (luctrcAcence inteiuiiy IntH- 
v«tc« » tial liie firobenpcdilc i*T.Tl ptoduoi 
liaa jj6hv»trted. TVimk # PQThebtf^an a 
lejHijiei'tlye and e quvrlU'htr dye Is ttriit- 
caI iiD Hie performance rjf Hie jJiuUe Uj 
Ibe 5 f wuvlw^w 1*CR 8\*My. 

, Ov« ni, ^* TI S <^wplct«\ty depci^dcnl 
on »he plty^;^! : i»*o*U««*iry u/ tbp Iw* 
dyti." 0 Rccouic of this, il lu«a buuit 
i4imcd the qucnclicf dye mu>t be 
«lUc)(ed nea4 the 5' uid. *vaninslut;ly, 
hove Touiid 4hfti ait4<4*l rt 8 * rfia - 
daWiJitc dye at the 3 1 viid of « p*wl/e 



t»Crt. ft«»»y. IniriUffmorCi clcavaRc ofvhtj 
it^c of proh« i?f not required to jicniew. 
jomc rcaoctlon In cjiiefiCltlflg. p}i^unu- 
dcoddc^ with a rcponr: dye on ih« S' 
rind and a qtwncheT dyr ar< the 3' end 
ttxhlnii o mucn mgtier repoTtrr jIuo/pj- 
cvuvu whun dnubJe*£itaiiti£U as com- 
nurecl wUh im>jWsaoiidc<2. 'I'Jiii jt^Qiild 
make u pcwiWc 10 use tni> type of dou- 
blaiUbeled. probe (Of nomo^encous etc 
u:eti<m of nucleic acid hyhnuftatlQII. 



' WATC RIALS AND METHODS 
0l)9Ofiucf«0Tld«t 

Tiiblc 1 »hows UiV nucleoddo eoquenta 
Of He cllKcmu<;icotlde» u^cd In this 
study. LUOcer arm nuciec«(dc (LAN! 
photyhornrntdHC Wijc ohnuied t/om 
Oicc Rtacweh. 'IlicsUrjd^fd 1>NA phov 
pharamtditcs. o-carboxyfluoresfiln (<U 
PAM) pbospnr>n]i3idlte, rt-ca;boxytet* 
Mmethylrhodamlnc sucauimuiyl ejier 
(TAWPa MKS rttcr), and I'nosphnHnk. 
for all aching « -blocking phn^pnate^ 
v/rrt oMftiiicd in»m PflrUln-Blin*:, Ap. 
pUwl oJojyiicms UMMon. («if;onucle« 
6 tidy synches^ was performed using an 
AB1 oiodfti 394 riNA $ynmcti2ei ^Applied 
ttUMyswm*). ?rimci Ai\a compiemenc 
uUivnudenodes were purUica wing 
Otf^u Puitflvatluri CW(rldx«> (ApplUd 
BLo&yi(.cina3. 0^ubIc-inlit:it;U ftfnln:3 t*ov 
.lyiuhvsbra with 6- PA M* labeled phov 
^Itit^iuitlt'UciiUlcS' *fid, MN rcplirct7i^ 
o»t: of Hu'Tj In the <tKpienc<» ( end J'hm- 
phaJink fli ihc 3' end. Hollowing Je- 
l«nirttu»i iini cHiariuJ precipitation, 
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FIGUAl 1 DHjnm of 5' nuclease assay. Stepwise rrrpjejerUKto-n fit trie 5* — 3' auctwjytfc ac- 
tiviry or Tat] una polymerase anuiK ira a tfuwuwnk jwvfcr durhiK one vMvujsutj nlm* pf l l VM t 



IWH Na4ttCfirt*0l!&lC \5U((Cf <pU 9.0) m 
room ccmpcMlurc. Uurcacicd dye wnj 

de* column, finally, I be double -]<i be led 
pry be wo a purified l<y preparative hi^h- 
jjeTform*n« licnikl chTomttU)jfcaphy 
^IJVUij using an Aqu»(iorr C M 22UX&6- 
mm column ^(th pflrticl« «e«« The 
column was developed vrJih 4 24*m)ti 
llutttir gradient of 6-20% accioiiUrtti: m 
0,1 Kf TliAA (hriclhyUlTlilH* <ic<itaie). 

>/ahei at4 named try dcugnaUng th« ««- 
<\ucucc ftom Tabic 1 ftivd the poaUloil ©X 

The lAN-TANJHA raolcry, l^r cKompJc, 
probe Ai-7 his sequence At vrJth f.AN- 
TAMRAat nurlcotidnpojcition 7 from the. 
.V in<i. 



AH I'CH lunpMficnion* were- performed 
vi\ xhc Ferkin.Elnxor CcneAmp Pl^K Ny*- 
tern iifcuO u&ln5 iU-nJ *cacUon£ ihul con- 
tained 10 n\M Tri.vHOl (pH 8,3), So iiim 
KCI, 200 p.M 0A1T. 200 ItM dCU\ 2U0 jim 
df;TP, 400 (U4 <iUTP ( 0,5 unit oi Airier- 
&ic uracil fMvcosvla&e (PcrJcUvW mer). 



jxnc (nuckuUdca 3>141-&43S in Lhc se- 
quence 0/ NakaHina-lifiiiia d iU) m wai 
«ai^jUtl«d using, fjiiiiico AM* aiitJ AiU* 
(TabU 1), which w modlfUd jM$HUy 
fr<>m thew «/ <iu Crcull et nK f, ° Actin am- 
plification reactions cemotnett 4 iinj*» 
20 ot human genomic J5NA, 
SO nu Al or A3 probe; and JUO am etch 



TA£1£ 1 Sc^ucacv? of OtigonurlcoiJdo 



(2 mlfi^, (10 mln), -10 tyclcft nf OW 
(20 atfli 6QTC (1 sn\n). «nd hold «i 73*0, 
A 51S-b|> «egni»jjL wiii aiiiplf^cd truii) a 
pjasnild that convitU <#l a segment ol \ 
UNA (xnrclcoUtkfl aZ^O-a?.,?-*?) in- 
serted in the Jmoi Kite 0/ v«ccor pVCl 1?. 
'L1l«BC CWtloiU UilllilHUUl X.A itiM 
M S <:t 1( t «{) o< yJufiifUd DNA, 50 flM P2 Or 
?5 |irtibc, 200 i*M pdm«r VI And 300 
jam pjluioj R1JS>. The ihtrmitl te^lmrji 
Vva« S0*C CZ min), (io mm), 25 <.y- 
dcl ^ (20 *cc> ( $rc CI mln), luul 
hold a\ 7S 9 C. 



Blunrftscesicr Dqtoctlnn 

Vor ctioh ArapU(U*atiu» rooellon, u A(Lp.t 
aliquot cf n sample iratii/ef red to »n 
individual W«M of a white, S»6-w«H m(cr». 
tiHr pluta £P*Aln.if\mcn. nuorwtcnct 
was motturod ati the Pcrbui-tJmc/ Taq- 
Man LD-StUl Jlyisicmj 'which ct#mMau M 4 
lujriincsconco oiHKlrom«t«rjr wUh 
reader iiaictnbiy, a 483*nTT» twcttaUon 
tat, And a MiUnm emluinn fitftr. facia*- 
Hon wtu at 4BR nm tutn^ a 5-nm lilt 
widvh. tmfjslon was measured at Slfi 

Am for fi-)*A,M (<hr. ropc»r<cr or H v*l«c) 
And nm forTAUHA (t^c quencher or 
Q v*k*t) vm'ng a KUnm «IU -width. TO 

ilsjn thai h caused by dcuvo^v o( the 
probd dvfln^ A'CK. three n(rnn,i^/jitIona 
axe ippiled lo Che law «a\ii»Uiii duo. 
¥its\ t cmbiion imcnaUy of a buff«i ljUnk 
Is sub l con led endt wivcicii^tli. 3tc- 
oi>d, emUxton tntensltyo^ the importer la 
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CUQlpl«III«Ml 



A<XCACA<JCJMCTOrtllXAt;CACrc 

ATG'i'pu oorrccooar.Mxrnm tic 

m CATCCI lC TCA.'fACATACCMjCAAA l CCC 
T^CO\O\aOT0CCCATCTAC0A 

1 wc xjyHAi ;aicTi>' 'Towvwrou 

A£WCCQCCCOCAl'CC.CA:iCCtvC0 l^J 

At^t :i;cAt;i;A7x;t ;c^*n;i;c;tx;A< :<;< jc^ta^ 
CGrrm rtcACrr^ ACCAAOAcAxv 

rrjkTTTcnocTOiAAaTOCAoanncuc 



Tof c3cn omoriuacutiuc ust^l In Ihto aludy, ihc nu<JKic add scqoencv U Hlvtin, written in m'c 

v . .lir^nw,,, r)knrirr ikitiv* ivn:« n{ uli«««i\ic3rtli4«: TCJi pllflfier, fluOr<ifl«i)e probe oSW^ 
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fiCU&t 1 Pt»*Iu vt f ' nud^w *»my tiM.tparing F-KJUA prober With TAMRA ac different rvcl<- 

#ti<l« po*Hirtm. Ax dec<fib«d In VfitdfUh and Metttxtt. vi'ii tmptiliiiiuonx annmnlns the Itt- 
£c»ud piotflt wvr« per6otm*d ( and the Hu<ir jjU'cucv emission was measured at 5 lfl arid 362. nm. 
Reported v (Kit* BA-thc avenge* 1 (ar itx reaction* run witiioui aridctd template (no icmp.) 
and six feeciloni run with i«mput& ( i tgmp.). Thy xcj raiio Was calculated for w»cb indtvictv^i 

rt«ct*oA MHl avcratjud to jjivc tlur wparled. (tQ" «nd Ur^* VnlutSL - 



award by the embroil jcicvfijfiy vf<lm 
qucncUec ro gjve an r*U» Cur co^li 
j actio a tube, Tills notmaUzes rorwrll- 
to-wcH variations in probe mnfl«rncra- 
non jno nuan:5cejace mtasurcTDcnr. J T 1- 
» nany, aR\^ in caiaiia;ea oy subtractta^ 
M\c KO v^iue of the no-tcmpl'dtc t,V0U0l 
fRQ") from Oic HQ vaiuu ^Jf llttf cvm- 
plc;c reaction indudlng i«mpis;e 

USULTS 

A srnn or prct>« witli IncrcrvMnj? dn- 
uaces oerwecn tne nuoreivcwi jl^wicj 
an J rliodainhia (5 u anchor wer« tcatvil to 
investigate thy minimua and ma^^Mim 
Jpannj mar would jlvc ftn ncccpmblt 
performance in the 5 ( nuacnar I'CH a.v 
«y. Tnese probes hybridize cn a tar^ci 



Ae^ue^tc in Itoc hunaan p^acrtn grnr. 
JlfcUic i fthpwfi ihV icmJta of *n eKjjorl- 
mcni In whl<h tKcx- probes *«eft In- 
ciudod In ^CR ilvif ^mpUflcd a jc^mcnt 
of fiK ^^ll^^i^tiibiiiiUis ihc laigri 
3vg\*cnwr- IVifuriiiKuw la Lbc S' ai^< 
Clewe PCJl «3Miy is monJiorcU l>y Uj« 
/na^nlDiric cf AKO, which Ira mcaaurr 
01 the mcrmiit; in rcporu? nuortrstwivc 
c»iMcd by PCK Apipj location c«I the 
probe (Urgt'K ?rub* A2*2 Jka < 4UQ v^Jue 
ifm IS ClOJf TO jxru. Indicating \hal iht 

jafobc wai uoi cleaved appreciably Uui- 
Ing the MttpU/K'»lHiu irovlum. Tills lug- 
KC*U lW with ibft <ju«npkex <Jyo on tlW 

>ccm\d nuclcoilcic fmm the S' wti r i;Ucft 
b iiisurik'k'nt mum fvii Tj^ polymerase 
to vleovc rlflvlditUy bct*wccn Oic tcporrcr 
aai) qvvncl^i. Tht- olhor flue prohei e«- 
h\httcd cumysrablo AR<i valuta Uwl art 



clear)/ djffcicnl from ecro. '/Vu?*! flvo 
fwohc* trc hcTDK cJwrvcd iIua^h l'C;k ^ni- 
]>)lf)uiULiit ic^uMini; hi b ahnilrti lucr^dsc 

noted ibnl complete di^non of a proh» 
ptoaucos a much larger increase in rcr 
)>Oftur auo£4tsccnco lUao that observed 
in fi^v»« 2 (dau noi skown). Thvi*, even 
In rvdclkrTO vrlicrc mnplifiialiun occur* , 
mo. rnainrity probe mcilcuulct) rviuuUi 
uiuJcivcd. Ji is mainly tot HUJ reason 
ihflt lUe flxioiowcnco Intoailty of the 
queachcr dye TAMllA ctianftci little with 
ampUftcailtm nf the larg:^, Thti la wh*l 

aUowi to u*t« rhc iOX-nrn f»ut»fcsccn<T. 
reading ±f> h rH>rmiUautl«A Jactur. 

mninJy on llic gucnchinii vffliJcncv in- 
her«nt in tne sp«dfic .^micinrc ol the 
probtt *md th« purity of the oligLUiutfe- 
o4ide. Thus, tb« laxs^* ^Q™ vaK>« Jfldl. 
caie Um profits Al-li, and 
probably nave reduced quenching 
vt co/nparao urim /\K7. ami. me oe^rr^ 
of <jwc*ixhlnj U syfJicknt tv cictocl ,u 
ht&hly M^fv»hcant in«tre»«w Ir reporter 
fiuciroccncc when eaclf th«fiit prulte« 
is dcavtd during TCU, 

Tft further investigate the Ability of 

TAUUA <in rhe 3' end to qunncJi fi^l^AM 

on Oar 3' cud, Umx rvddilionnl pnln> o/ 
probel were tesved In ihc S' nuclcasL* 
PCR oaiay. Fen «»ch pAii, ooe pxobc Iias 
TAMPA ottoobrd (p rtn intor<»l nutJi.'* 
ntli)c and die otWei Uw. TAMIU ^lUclucd 
to the 3* tnd «\ideotldt. The tcaulu 
tUowi» iil Table 2. ?0( ail (hrcx set*, the 
probe with tbc V quench tr exhibit u 
ARQ vnlviv ilwl iv cor»iU«ra(>)y 
thai* fvr Uk probe whh ihc In-krrnJ 
irjuencbcT. The M\' value* sng^wl thai 
<li{r«rvnc«o Ln qu-«nchin^ arc r<u( us ^«-ijr 
«a Ihosc ubaxTvod v»ith >omc »f th« Al 
pTobc.1, T1ic« reaulla dctu onfilrftle IhOI 0 
queacher dye on ihc 3' cod at an oligo- 
rwcic^t^lc ran quench efficiently the 



TABlt Z KesuitS Of NucIcam: Assay Compering ProU<w ^lOi TAMRA Attached to an interna] or 3'-tcrmin«l NuolwiUdc 



mn 





I*roh<» 




+ temp. 




+ IW0]3. 


net 




AKU 




A3-6 


1 3.2 
lt,\ ± 2.9 


236.5 a. U.1 


11(3.2 i 0,4 
^,2* 4.0 


17,^ ,« J- 2.5 
00,2 J- 3.8 


t).i7 a. (Ml*/ 

cm -i U.U/ 


0,73 i O-OX 
2,6e - 0.U5 


i:/6± o.o; 




^Z7 


62. B x 4.4 
113.1 = 6.6 




lOi.1 3 <3.1 


120.4 * 10.2 

lia7 = *8 


0.5? i oi>a 


* 0,16 


t 0.10 


*u. 


1*5-10 


77.3 = 6^ 


2MA a )5.d 
.133.^ a 12.3 


11*1.6* li.T 


94.7 = 


0,B9 - 0/0 J 
(I.A3 i 0.0?. 


2-SS ? acv5 


1.60 x O.Ofi 
2.89 i 0.i:i 
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fluwtiscenrr r»f a mpnrtwr dye- on the S 1 
outl 11 tu degree of quenching U tuffi- 
cieul f^r Oiid (yyc of uU^romiciceUde to 
be uxed «s a probe in the V nuclease PCU 

To text thu hypothetic that qutwhing 
by a TAVIBA Uop-ofKia on th« ftcxiUli+y 
oJ cha uligomiclcoddc, fluorescence was 
rncawmnl Tur probes in the stntfle- 

SUSA^ed and Ouubltf itrandtirl ctafe*. Ttt- 
hla it fcpo<U (h« fluuiwcenvo •dhfiftrv^ 
at Sin and ^82 nm. The relative decree 
of quenching la Assessed by calculating 
thr RQ ucU>. iVir probes With TAMRA 
A~)Onucleaiidaj frem lh& S' *jiJ, tliurv 
Is little difference in the HQ values when 
comparing siuEie-suandv.d wuh doutle- 
siraaded oligonucleotides. The results 

/or prnhct with 1'AMRA ft (rw 3' vnd are 
much different. For tha*« probee, by- 
briducatlon to a complementary strand 
cnusca a dramatic Increase In HQ. Wc 
propose that th<s Loss of quenching ss 
caused by rue rigid structure of doublo. 
Uftnded UNA, which prm/nnf* lhc 5' 
firtd3 J ends from being In proximity. 

When TAMKA is placed toward ioc .v 
find there U 1 Di^rkeid Mg a 1 effort un 
ouenchln^. Figui* 3 *h«ws a plot of ob- 
served KQ values fur thu Al seilcs of 
probes as * function of v^ 1 ** concern ra- 
don. Wiih TAMKA attached ncftr th$ S r 
end (nroh* A1-2 Or Al-7), the UCfcualue 3| 
0 niM W^" 1* only slightly hifebc* than 
RQ at 10 iah Mr*'. 1'or globes AM9, 
A1-Z2, and Al*Z& the *Q values atom** 
Mg*" art very hwh, indicating x xoucb 



induced quoncning cfriclonvy. Pur eneb 

r»f these jKobei, Uwiu la H marked do- 
tjrvaac in HQ oL 1 mm Mf* 1 followed by 
a gradual decline as ihc Mg* ' umccu- 
trutian inrrea&at to TO niu, Piubu Al-14 
allows an intermediate ItQ ^a)uv at O mv 
M$ 7< -w54U-n ^mduol decline bi fUiiUJ 
Mr** coiKciiiuuiHis. in a low-sail en- 
vironment with no w^ 1 n jnesent a *w- 
gU^hiraiuUsl^oJl^onycWolJdc »rould bo 
exacted -lo *dcpl am vAiriuIvd confer- 
iftaiiOfV because of cKctrorailc rcpul- 
jlqin / The biriiiiixg of Mg fl * ion^ nc%s to 
^Ucjd ^ at mc phox- 

backbone r p»"ekar ttie oil^nucJw- 
otidc can adopt conform ailonj where 
the M' cna is close to the y ena, There- 
fore, thfr observed Mg= ' effects suppon 
the notion thai quenching p\ a 5' re- 
porter dyo by TAM^A at ur nur the 3' 
end depends col the flexibility of the oll- 
fcanucleorldr. . 

DISCUSSIOH 

Thr jtrlkint hndlnu of this sludy Jx thfli 
ti . f€ems i)\e rtiociamJoe 07c Tamka, 
placed at any position in an olifjQnucJc- 
etidCi eao Quench *Hc Ouoroceni cmly 
jinn of ii nuortsccin CO-l J AJvt) pUwd a* 
the 5il; : I^P^ tnat a s'aftk- 
xtra r^<-a, ; dcxubU>Ubol(-d oU^onutfle- 
otid^ must b«* M»U t» adopl conforma- 
tions vrlwr* tha TAMHA la duac tt> rhc 6 
ejid. H should tw noted that Hie ducay 0/ 
<S-!'AM in ttieixtlted sialc requires u ctt- 
toln Amount of time. Therofora, whaf 



TAfiLt 3 Comporl>oii oi flunrc-.%ceiKr P#nU»^w«).wjf ^iui:lc'^Trondcd and 
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05) SlnglcMirandrd, Thf fluurvvcrjice ;miwiom i\ Alfl ut 5S2 nir for .mluKnna mntfllnlnK 8 final 
\\)ncemrfftlon of 50 iim iiullcainl probe. 10 mM Ti«-lia Irl! 50 mwXa, and 30 mw Ma^V 
(40 D(iublc-KCr&rdrd. "I^ir, pnluilcmS CCinv«inefl, IrvaddiliCUt W nM AK' fur prnhr* AV7 »nd 

ai-?Xi, 100 n« A3C for prober A3-b and av-M. 100 tin l*i<: for iirniw 17^7 jinu rZ-7J. oc 100 om 
rsc /ur probta woo and i«s-iH. ncrurc tor. aoc»i\inji or m&Vv jk»i« e^eaaj wmiJie wax nested 



nvittarn Tor cjucacliluy U nut Ur- 'j^crigt 
diethnctt bot*wccn <>*i*AM and TAMIlA 
tout, nthci, how cioic TAMKA can fet la 
fi.i'AM rtuxiiifi die lifhtime of uic q.i*am 
oxcitidititc. A& ioaft ~> die Uway tdiiu nj 
tiw excind autc *5 relatively lonff evin- 
pared with the molecular motions of the 
oiigrmucicotuic. quencnlng can occur. 
Thus, we propose toat t AMRA at th« 3 ! 
end, or Any other peridot), can cjMeudi 
6-FAM at the V end because fAMrVi is in 
T>TOx(jnU> to (i«VAM often enough to br 
able 10 accept energy Transfer frurh an 

CXCltCd ti.J ; AM. 

DqiaJlfi o/Thv ClucmctyK* measure- 
rocnb remain pvizcllufe, fox example, Ts- 
ble 3 shrnvs that hyortdljration of piotes 
Al-26, AEJ4. and l^-2« to their complc- 
rnentary ^"d* only causa* a hrfl* 
incrensc in 6-PAM fluorescence at 518 
11m but also cei use* a modesi Increase in 
TAMHA fluowwena* at SRZ ma. If 
'jAMtiA is being ezclttui by mecgy trans- 
fer from quenched 6-VaM, ihen loss of 
qtioaohinf; attributable t» hybrldlsaUon 
should cause a dccreaNe In the fluores- 
cence, ernissicm of TAMRA, "me fact that 
the f luoiescencc amission of TAMRA ln- 
crefuci tndloalu thux Thp ^Uuailon U 
morp complex. Knr rtample, we have an* 
ecuniai evjoetice thut Ta<r oasos of rhe 
oligonuclaoTiritf, especially Ci, quoncn 
tl\e. fluorcxuticu of botli and 
TAMflA to 5ornc degree. When cJouble- 
stMiidcd. basc-pal/ing may redurr. the 
aonity uf The oases 10 quench. The pri- 
xjiwy factor causing \)u quenching 0/ 
0«fAM Jn on Intact pcobc « \hc TAWBA 

ctyc T'.viacnce fa* Uie imponont* of 
TAXinA it that 0 if am .nuuiv*Lcmce 

icmuM* r^UUvciy uiiaianycU When 
^iob«* bW«U only wich UA'MA are u«cd 
in the $' nuck'OJC PGR fl»aay (<lala no* 

5howju. .ietX>D0«ry effectors of Audio* 
oencA both before and afiei cleavage cf 
th» probej ivccd tn ho explored further. 

Regard lesa of the physical mocha* 
nf.im, rhe rdatWc inuVi>cndcncc of posi- 
tion and quenching greatly jlmplUlea 
Hie dad^p »f ptnhef for tne 5' nuclease 
I»CR asiiAy, Thcire are three main fictori 
that deKvmlnc the performance c*f a 
double-labeled fluorescein probe In the 
c' nuclear l*cn o«ay. The (Uzi (ticUa 1* 
the dearer of quenchin H ol«K*iv(»d in tht 
intact probe. This ta charactered by tht 
value of HQ* , which Is ihc ratio of tt- 
pnncr in quencher fluorescent cmU 
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mM Mg 

RG\jrs 3 fcUf*«i *f \A&** tt>i\<xr\it^M\ on XCt ratio Iter the M iertc« ol prol>««. Th* fluumuwH* 
ctrdsstoa InUncUy ai &1H and SS2 nm was mcWUrcO lor soJuthJcta omulninj 50 nw proU, 10 iTW 
Tdi-KCI (p*i M)# 50 mu KC!. (tnrt varyirtc unouais <0UQ inwl of MfftV'lr* r»i<viat«i ur* 
/nil<?s nnft intvntliy clMfigo ny :%Ji2 nm liiMTwHr) an- i>W»llril yj. MgOa concentration (ihm 
Kf rt ). 'Hi* L*<y (up]M *ighl\ i»nm» tlitr fMUttiu^ vuiuWinl. 



dyes uoed, nF»a*io$ be.tr/ecn reporter »r%d 
quencher dycii nucleotide sequent 
contcxl cJfccta, presence ol * Uucturc or 
ulhct fttCtw.i iKoi reduce rUjdUUiy of 
Lhc aUgonuclcoUdc, putKy of lh* 
probe. The JGCtmd facior 1* Ihv.tfKctr.ncy 

* uf brbtnUzaboti, which depend.** on 
pcoW T^f procne* of lecoiirtiirjr 3\ruc* 
turc in probe or tomjUie, armcahng 

• icmpcwiuK, and other reaction c&ndi- 
Hons. The t'lirrt foclo;* U the efficiency at 

' vvbkh Tflg DNA jwlyjrivru*v cfcfcvcs Ok* 
, bound orobc bclnccn lhc /epciAct and 
quencher dyes, This cleavage la depen- 
UciH on sequence coxnplcmcnOjrUy be- 
tween probe *nd tempJjte as jha>V!i l^y 

sefcnicm be ween icpona ati<l qvench«r 
dyeii (irubuca)ly reUucv: acuvaK?\if 
I«ciie. W) 

*lli< fi*c in RQ value* for the $©> 
Ties of probes scx.tti)( to indicate th^t 
rtc;;i?e of qurnchloc b rcUucvnj wmv- 
wait as ine quencner is pmcud loKVtird 

ing Is observed for piobc A1-1C 

5) ninet Than for the prob* where the 

TAMRA li a\ Vl* 3 1 «nd iTllS u 

■imdcratandflbk, »« the conCormMion of 
mc 3' ena posUJotj wtmw t>c exported to 
ba lees feurtctcfl Than (he canJonrwioit 
cf an IrUflmai ro*itu>n. In effetx a 
quencher at lh* S' «nd i* lr€€i to atlopt 
Ccn/omwOoni clo«e io the S* MportCf 
dye than b on intextiolly placwl 



pfol^, tlie JnterpftnadOH of RCt valued 
h lc«f vkur-ettl-.Thc prolicj *how the 
soracircod « Al, wJOi lhc 3' TAWIU 
piabc b%vin^ft Ur^«r AQ." Umn Uw la- 
UrnAj "JAWIiA ^obe. Por Lhe pflli, 
ht*h |»r«>bwt Kftv* Admit the sinir UCj" 
vdlu*. F^** lUc DS prob«, lhc RQ WUie 

probe la lc« |h»n fw llv* ini^noMy 
lADClcci probe*, Another (actOT tK*t mny 
wpUih iomc of the nhiterveul vartaKcn t( 
iha( pwify «ff«»ct.i tb» RQ" value. AU 
OiCiU^h nil jjrobo *tx Hl'l.C puWf;^ «t 
•moll omaurrt of con b nil notion with 
uxupumchad reporter can have n large el- 
fctl onKQ - 

Allbtiu|;b (licre iit*r be a mt>dv»l »J- 
fecl on degree o! qvieiKhlhjt, ihc por- 
tion uf tlic ^uwaljer <ip|«Kc/'tty uiu 
lmt« b Ufgc effcel on 1bc c*Ticienry of 
pivbc vk«vft£c» Tlir n\auk <3r<t*fcic effect i« 
ObSUm ; 0 wllh j?ro)ic Al>2, vrhe«: plft€C- 
mcnt nf \Ke TA^lU on lhc awoml itu- 
vlrutlUe tedLujes lhc efficiency of clefts 
egc to Almost *oro< for the a%, 1% and PS 
probr*, ARQ 'U mud* urcnlftr for Uu 3' 
TAM>tA prohrts a\ compaird *"!th the In- 
icrnAl TAMHA pmbti. Tn\s 15 explained 
niesi easily hy atisuinlup U*«i picaboi 
witM TAMRA^i the 3* end are more Mlcely 
to &c cJ«avwl beiwccj] icptnlc* 
quench «f than art pfOVics with TAMRA 
Dllachwi inkmally, V<ar the Al prober 
the dcavuffl flfficivney of probe AI-7 
must.aJrra^y qiihe high, ax AftQ dCCi 
noi mcT^Ksr when the quencher is 

Trt rh tt ^» pud. Th^ illua- 



trnlo the imporn»«rn nf holog Ahl(* to 
use probes Yflth a gutfichor on lb© X' 
did la «1C y nuclease r*CR away. In lhl« 
an JiVcrc«Bfc in tic mrcnMlry of ro- 
potior ft\»occsccncc is obwirved only 
w)iL'H Ui«f prwbc h cli-Bvvd Uvlwjl lliw 
leporlor and qucncbcj* dyxi &y oJncinK 
thu lojJiuLtjf •Mia qde/icUL'i dyut uj\ lha 
opt>o»Ue 4ftda of j.« r>I((vOOUClecUdfr 

prubc, *ny clo*v*«c iiul i^cctm wlJJ be 
dcuxicd. When die gucuclicr Is u\tuchv\1 

juubvr WLick* wdl (Al-7) 4nd4tlht:r lUtlw 
noi so vdi (A3-6). Tbt foUUv«ly taoii/ 
performance of pxebe pfc*umi»bly 
m»QC tho probe U boing cleaved 3' Vo 
the anchor to*h«t «^ batween 
rppnripr and quencher. TJ«refor<f. ihe 
bcit rliancc of havirii: s proho that reli- 
ably ddects accumulation oj PCH proii- 
uct in tb« V nuclease t'CK u&ay H to me 
u ptobp wirii rho rcpofle* ond quenehoi 
dye* uti opposite en<t«. 

PUcing ibe quencher dye on th« 3 1 
and may provide a SUtfhl boncflt L» 
iernis fs< hybrullxutl'Jn cf»i< ieacy. UlC 
prcscrjcr o quencher 4iwcbcd to an 
Internal nurieolldc hiJrJiI hr cxpczM tv 
dlkrupV Li***-jituriny ami reduce lhc T A( 
of a probe. In fcwHj b ruUiulitoi 
In hni been obacivcd fx ^^b<s 
Willi i/ltwxidlly ouadied TAMKA*.'** Hiij 
diimpii^ aff«d wool ci be minimked by 
placing the qucfKhci at U\c 3' end. Thui, 
prcibrsi >*ith H' qvcpvhcn rnighl ?>chlbU 
,\Ui;h^y higher hrbriclimtion e/nct«nt'l« 
Ut^n ]jiobeS wiOi IfjlertiaJ quciivWeii. 

Tho cftmhinatlon Gf increased d«v« 
age «>d hyi^rldUiUon tffieUnoe* oieatW 
rtlttt i>roi^ca rrilh 3' quencher probibly 
will be inoie loJcfditi oi mhmo\<h& bc- 
iwcn prol>e «nd i6Pget a a compared 
milt joU/i^lly Ubtlad piohat. Tl\u lob 
erancc of aiiom4l<h6* c*t\ li6 cdvAjHa- 
g«ou3, as Mfhcn trying \o u>c a 
probe to detect PCH-aniplificd product* 
faun nMi.|#l«K uftiilTciejAt SpeClCS. Al50, U 
mean's that <;le5vag# of probe thirl litf PCR 
Lj lew 5CJ#,*lHvr b> uHvcutiofU ill AiV. 

QC311B^ tcmpcraturr. or other lesieUoii 
coaAdiuonc. Th« on« application where 
tolcroucc of mlsrrvatch*fs may be .1 dlsad- 
vaaia^r ifl for allelie discrlmlnfttlon, I-ce 
ct ftl. <M fiemonjlratcd Ihsi alleW|»*c)£iC 
pror** wiitc clcftvcd brrweert rcponer 
and ipwnclicr only when hybrid Uca to s 
ptTfcctly complementary tar gel- This aJ- 
InvH'il them to dbr!n}jui*)i tlie norma* 
h\inian cysilc nbiwfa allele Irom thv 
AF50B mutant. Tbdr probe* had TAWA 
«ruchcd to inc seventh nucicoUdc from 
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F10UU63 Klfect of MS 61 wmtiimrrwlun on RQ. Wtto tor the. A] «erl« of prolw- Tha fhuMCWiue 
emiwiun whcttoV al MR aa4 ift2 nm was measured fur atiluUmw «m lairing SO nu probe. 10 mw 
Trtvlia (pH S.3X 5tf tnu KO, and vatyinfl crnouiO* (0 10 mwl «f 'MgO r ;)V cafculalCd RO 

Ms). The key («f/«*- *dr*f) «howi me probes examined. 



dyes tneo, ip&cm$ rjctwrrn reporter and 
quencher Uy«K. iwdfioilfle srnnjr.net, 
am text effects, presence ot itructute or 
olher uctors thai reduce flexibility of 
oUgoauciifOTirtc, and. purity of the 
ptobe. The somnd tartar IS IOC efficiency 
of hybrkdlMtioxii which depends on 
probe 7^ presence of accondacy struc- 
ture In prob* or template, armeollng 
lemper&bun, and other reaction condi- 
tions. The third factor is Th« efficiency ar 
wuicn yuq UNA porymcrwe cleave* tha 
bound probe beewcen the rcf orter and 
cwicnchor <iy«, itiis cleavA^ is depen- 
dent on sequeoce complfcmenuruy be- 
rwrcn probe and template » ill own ty 
the observation that mismatches in the 
tegm«nl between reporter and Quencher 
dyes drastically reduce Hu deavaue of 
probe™ 

The tbsr tn RQ values for \h« Al se- 
ries of probes seems to indicate that the 
d«fl*w* ol quenching is reduced sota*!- 
whet as the quencher ia placed toward 
rnc \ md, The lowest apparent quench- 
ing is observed for probe AVI 9 (sec Fig. 
3) »tUe/ than for the protse where the 
TaMKa is at the 3' end (M-26). TO* « 
•unacrsundHhlc, « inc confotmwion ol 
the i' end ponuon would be expected to 
be lea* rgstrlctod than the conformfltSo'n 
nf an internal position, in vtteX a 
quencher flr the 3' end U froor to wtopt 
eomormatjiins close lo the 5' reporter 
dye than Is an imernolJy placed 



pivlx-s, \hti liUurpreiaUnn ol RQ value* 
Is its* cicar-cui« Thi A* prnbes show tbc 
»mt iTcrid 'iii Al, wUh \h« 3* TA\4*A 
probe. lUViils a larger RQ" lh»m 
IWOttl TAMBA pruU*- the 1*2 i»«ur, 
butir piobpi hayfl about the same RQ 
Yalur. Tor the.I&prote , *l» RQ." *h* 
T.piotw U lew <htwi (vi tl»c mUuUlly 
labeled probe. Another factor that may 
cyrplain some of rfte obxervr:^ variation U 
that purity olteafi: the RQ" value. Al. 
though >11 prohc? are HPLC puWItert, a 
umaU amount "crintamiQRtion with 
viiiquwiched rcponer can liavr o ef 
fecioaRQ . 

AllboUjh there may be a modeit uf- 
feet on d«fjre? or quenching, the posi- 
tion of The quenrhrr fluiiatenUy can 
have p large effect oa the efficiency of 
probe cl envago, Tb* drwtlc C<fca Is 
observed with proh* Al-2, vuhero pi ore - 
ment oiJh^TAMRA OH the weond nu- 
, ci^^ri^iices the efflctency oJ cicav- 
C^c UvulihvU iCTOvj or th* A3, P2. and VS 
probw, ARQ u much gmur ror rhe .v 
T,\MiU proha,K compared with the in- 
ternal TAMKA piohes. This Is explained 
matt c«Uy: by awuming that probes 
wuh TAMRA at (i* 3' cn<l ita mote Illcety 
oe r cleaved bewwen reoortar and 
quencf\«;T iiun are prob« wtth T^RA 
ertftiffocd intern^ ly. for the Al prober 
the clcavoRt efficiency of probe A 1*7 
must already be quite-high* « A*tQ <to« 
not Increase when the quencher u 
placed closer to the 3* end- This Ulu«- 



iratas the bnpot\«uce ol boing able f;t 
ufle probes with a quenchec on th« 1' 
end in Ihc nucl«ic I 'Oil oaauy. In Uvl» 
away, an iner«a«e m tho mten»cty of r* 
fK»rf«f fluw»c*nc« la ohcirved <«nly 
when the probi U ci<avcd between mc 
TepurtcT »nd qucnebv* ^y«»* l *y l^^vlng 
tho wpotWft »n0 <iuer*chc? dyc» oa tha 
opposite *ndt of aji oUgotiucieoUdc 
pwlw, h«y cW«w»u« \Un\ OL'Uiri w(» 
di!t«aed. When the quench *r t* attached 
u» j«» LiLd'aiJ liuclqoddu, »uitjo()m^t lha 
prabe wArit< Wftll and other 

not *o wi«U CA3-^>, Th« relatively i^or 
performance of proha pretamably 
means die probe is beUm cleaved >' W 
th« ^wenchcr rather than br.tw«n the 
repcrtcf and <jwrrehvr- "''hertttora, Uie 
best ehanc* of h*vlr*fl * probe that cell- 
ably dcteets acvuuiulHt&uxi of PCiR proO- 
uci in the V nuclease PCM mi ay U totit« 
u jiroUft with *hp icpbrici and quencher 
dyca on opposite ends. 

PiocloK th« <jucnchcj dye on Ute 3' 
end rn ay olio provide a flight benefit in 
twmx M nybriOlzatlon clllcltncy. 'I'htt 
presence of a qunnchur artarltcd to an 
lnternut nuvWu*;de *n^hC txj expeded \a 
dXtrupt base-pairing An<) rcduc* the 7 m 
of j prooe. in fict a 2°C-3 f C redtict/on 
;<i 7',,, hu Uscrt observed for b«© proW 
*rith InKifMlIy attached TAMHAs, 0 ' Thb 
diArupuvc effect would lw. mlnimiscil by 
plJcingTMe quencher el the 'S' end. Thus, 
prober wiU^ V quenchers mluhl chblbli 
>Jl*htly higher hybrldisarton ttficicnclrs 
than probes with internal qu(incher.s. 

The tomUnaUnn of Increased deav. 
a^c and nyhrldiwtio^cfaclcocles rnaanti 
ihol pioh«* wWh 3' qt^nchers. probably 
will be more tolerant ot misraaictics be- 
nveen probe- and mg*it a* coaipaied 
with imcmaUy labeled probe* This wl. 
eranco of mismatches wn tit advttnta- 
gco»a), ti» wh*w trying to use a alngip 
probe to detect POUmpiifted pmrturt* 
from samples a( different species, aiso, 1* 
m*«iU mat dcavarc of pr»b<* during VOL 
Jfi lesR «en\ltivp w fllicrttlons In an- 
nealing t«mp^raturc or nrher reaction 
conditions. The one applicatton where 
tolttranee of rnJimarchct maybe a dUnd- 
v*nia« 1: for *JleHc UlsciimlnaUon. Uc 
et al.'^ demonstrated jhat allclf>speci«c 
prorjfs were cJcavtd between Tat>oner 
ami quencher only when hvbdiliwd t^ a 
iKV^cUy complementary tanr.ci. Thiv aU 
]pw«d them to Ulsiln^uhh the normal 
human cy*W< flb'osls ahtlc from the 
AttfJft rouhipt, IIuit probes had TAMRA 
jitiached to the seventh nucleotide (iom 
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,v «nd and w? M <4<>d$nod to thai any 
mlsmotcVio were bctvvcun tht roporlot 
di\d ifueAchcr. Increuinx the distant; 

Icaico I life dHfupUw effect of lUlK- 
CU^ha and allow cUavago of (he probe 
an the laconoct targyc, Thus, probes 
*Mh (i qUCi\Miof attached <o an internal 
itudeot)d« may si 111 bo ycoCul foi allelic 
Htsrrifninattan. 

Ul IMS study lott of quenching <u-pon 
hybridation wax used 10 ahem that 
quenching l>y a 3' TAV4HA i* dependent 
un ilw flc*lUUlty ot a sln^l»wand* CI oH- 
gunu&eutid* The increase iu Kporicr 
Jtuoidftcciu'f Intensity, Tfloujh, could 
^>oIm mad to dvttrmino wuounei: i|y- 
OfWlzAUon ixa$ occurrud or nor. Tnus. 
oVigoauclqolld« vyllh t*po?t«r and 
(JUCIKhcf dyes Attached at of#f»«.iJlc end* 
MiouUI also be useful 95 hybridization 
prol«*. Tho oWllty to Ocucl hybfldUcA- 
Ttun In real \lnie meant ihai iIksi* pru^us 
could be ustd ro Tncajuro hybrtdUatlcn 
Kinetics. Ato, ThK iy|ir nf probe could he 
used to develop Homogeneous hyhrul- 

^ration Pssoya toe dUga©f»4ie» or Mh»r »p- 
plications. Unwell Ct *l. (i0) describe just 
Oils rype of hamojienctms assay where 
hybridization of a probe Celiacs an In* 
tTuflia In fluotocoonce cau«cd by a I osa o( 
^Uftiwfclng. Mowcv«c, thty c til i sod a 
complex probe doign 'tcjtfhtr* add- 
ing iludctHUlCi to bo;h cndi (if the 
•pro bo iccjuimcu to foim wo imperfect, 
fuiirptfw. Th« wsuits prwontdd h«w 
dctiuumnu; chut the Mm pic addition of 
3 rqparLw eye u> one end of an oiigpnu* 
dcotldc ac<i & quenchtsr aye uic oiha 
<*'^fl g^norAtoc a fluomgemk profcw thai 
«»« detect hybstdtisaHi*! or I'CU amplifi- 
cation. 




U<H 1)y milUlnp, thr 1' *o S' «*«»micUr*M 
»e(Mfy of 'J>icrmi*i aqvatieus 1>MA p**y 
mciw*, *torA /ijorf. So. «8t ?3H 
JikC, 

M. l.yam<chr*. V- MAO. Rrnrw, onrl ].f« 

nucioaiytic dcivage o( nucleic atiUi br 
ouHftCltTiiJ i>na po(yme«ws t adentr 

Knor()!(rWjind«iniaE\Jod Fluoreuetiz. Ann. 

J*»nr{jiJfT «/ f1i*ciaceni tfiCCUaXCQpy. f 

^U. ii^. Planum P*w?,.W(iw York. NY. 

wansfwi A «p9CWw«opig wloi, ^mc /tou. 
Ae&J.!u\u RH;710-7i»#i. 

,iuwi tt/ thu nunun cyicipiaiirtic DcU-JCUm 

Hu in dj« lnt/ciA4. I'/.*. Vol J, 

^v. fta.cu^ on:. 

0. du BkuII, Piiui, -id HV.Mcu- 

dcW. lsjyj. Qu.ntU«Uun of (&.&ctin-*pr- 

Clflc 0)T{>JA tmnkVltntV Ultng MA 6 { Am 

Otdtlve POL i'Oft Mitho* X/V»Hf, 3: 57- 

. jy. . 

V, IAVUK, KJ, (urtWbl.)- 
10. »«w«U t CB« M.C Muiucon, tU» 

t«nsCA f aid t-). Urrcn, A r\fw |w- 

2423- 



Atjwiw J^, JPM; accept** In 
mixed form Mtfck 6, 



Wc •cfcnowl*d 6 * Unc«in McBrldc 0/ 
P«rfcJn-Lbne* f&t hh (o up port and *n- 
co ur agonal on Oils prujvii and MJich 
Wlnnik of the Uulversily ct Toronto for 
K*1 p Kj1 ciUcuicJoiu- on Urtio-tiAulvcO flu- 
ore;ccnco. 



I. 1-r.c, UC- C.H, ConneU, anc w. uiovii* 
loil on rca flu-ofoxcnic pfobfs, Nvr- 

1 11 ll... .t nil Ifc t\ *C. II Tl/„|- 
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f-jtVe have enhanced the polymeraao chain 
iWactiooa (PCR) such that specific DNA 
sequences con be detected without open- 
jug tSie reaction mhe. This enhancement 
requires the addition of ethidiuxn bromide 
£(EtBr) to & PCR. Since the fluorescence of 
KBr increases La the presence of doubie- 
|srnmded (da) DNA an increase in fluores- 
fceace in such a PGR indicates a positive 
Lmplificarion, which can be easily morri- 
cd externally. In fact, amplification can 
continuously monitored In order to, 
oilow its progress. The ability to simulta- 
-frieoiisiy amplify specific DNA sequences 
pond datect the product of the amplification 
l^bpth simplifies Axid improves JPGR and 
|iay facilitate ita automation and more 
| widespread use in the clinic or in other 
Kcuations requiring high sample through* 

tt hough the potential benefits of PCR 1 co clin- 
ical diagnoses arc weU known 2 **, it is -till not 
widely Hied in this setting, even though it i* 
four yearc ilnce thermostable, DNA poiymer- 
fe* nude JJCX pj-actieai. Some of the msom for in alow 
tccepca^ce are high cost, hc*c of autoioadon of pre- and 
~QWCR profiting seeps, and £ahc poslUve results From 
J^yovev-contaounation- Tne 6rst two poinu arc related 
ftfut lab err is the larg*et contributor 10 cr*st at ihc present 
age of PC* development. Most current aaaaya require : 
l^n form of /, dowfutrtani M pro ceasing once tfacrrabcy- 
'inj U done ^ order to determine whether the caiffcc 
wA. s*quenc« vaa projeot and has amplified. The5c 
ftcjud* UNA hybriHirafi'oa 3 '*', gel electrophoresis with or 
[rriihouc use of refltriction di^mon 7 * 0 . MP1C", or capillary 
V^Q-ophoresia 30 * Thcac methods ara hbor-intense. have 
throughput, and axe diffieult to automate. Thelhird 
•Pjiflr ia aJso doady related to dovnatreim processing. 
Handling of tKc PCR product in these downstream 
^caaci int,ic5\3C3 the ehtmcea that amplified .UNA will 
freao through chc typing lab, resulting in a risk of 




"carryover" false poarivaa in subcecjuenc lasting 11 . 

These downstream proccsennp; aups would be eHrni- 
nacert f f specific ampKficaaon mid dewerion of amplified 
DNA wot place simulaneovub wiikia an unopened re- 
action vessel. A53ays in which, such diffcrtrn procs&ces take 
place without che neerl to separate icacfJon components 
hava been termed ^homogeneous". No truly homoge- 
neous PCR assay bas been demonstraced to date, although 
progress towards this end has been repnned. GKchab, et 
developed a ]?CR product detection .tcHc-toc unnff 
fluorcaceeit pTim*r» that rtuuLt+d In a fluorescent PCR 
product AUcloapecific primera, *ae)i with. different fluo- 
rescent tags, were uaaa to Indicate the genotype of Che 
DNA- However, the unincorporated piim«r» must a till be 
removed in a downstream process **» wd«r to vkuaH2e tfie 
result. Recently, Holland, ct at 43 , d«v«toped an assay in 
which <he cadogenoxtt S* cxonudoa*» amy ot 'Taq UNA 
polymerase was exploited to dcn^Q a khded otigonucleo- 
cide probe. The probe would only <Icav« if PCS. ampliiv 
cation had prodiiced its complemcncary sequenoe. in 
ordjer to detea the cleavage productfli how^var, a subse- 
quern process is again needed. 

We have developed a truly homogeneour waiy for PCS. 
and yt JR produa deiecoon based upo» the gr««dy in- 
creased' AuoresCcjace thai crhldiuna bromide and other 
DNA binding dyes exhibit when they are bound so ds- 
DNA 1 * 4 ^ 6 . Afi outlined in Ji^ui*c I, a proic/rypic PCB. 



fPCRqrtcg ) 



/ 



(up ts WHMacQ 




tup <a liVtcri U$) 

l PrinApfe of (Imnlemoous ampUAcadoQ aad detection of 
FO R product. The components of a rfocoaiaimiyt ExBr thw^rc 
fluorescent arc listed — Etflr itself, EiBTboa«i«i to «th*x «DNa 6r 
daJ&NA. Tfcwa io a Ufy« Auorescenc« «nhanreio*nt wfcen ScPr 19 
hound to I>NA and btndme ia greatly enlianced vhen DNA is 
double-fltranded. After symdent (u) eyefe* of PCH, tKe net 
utorasc in d^DNA remln in addibonol £t3r biadicg, a net 
mcrcofl« in toul aaorteeence. 
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fKUtt a C*i «Lifiiwphofcai of PC* wflplification products of the 
human, nuclear E £»c. ULA made in the pi-cmee of 
increariiiK *mou«L o/ EcBr Mp CO ,wjAnl>. Th« Pry**™ £ 
EtBr haa no a Wu« effect on ihe jtcla or ipeancir? of ampUfl- 
eatioa. 




RCQtf » (AO fluorescence meajureracnu W PCSx ihai Concaln 



0.6 ^g/mi fctHr and to arc specific for r^nwtw^ 
imace* Fife rcpUcw rCOc w*re begun MUtum? each 
DKAT^ifieci. At «ch indtoud OUJ of the 

PCJU &r each DNA wu renwwed fro* -ihennocydta and ^ 
fiuora«r/i« measui^ Univ. of auorseence *«* b ^^<*> 

Si from a nale nWA and cortcrol mcmm wtHout any DNA, 
from <AJ. 



beau with primer* tha: aTe 3in$ia*«rari<l«i DNA (m- 
DNA), dNlfc, afld DNA PoWm^. An amount of 
dsDNA containing the target sequence (target DNA j 
al*> typically pweV This amount can 
oTcJlc^pli^on, from slGflc^cU mount* ot DMA 17 w 
microgram* per KX U . Jf Et*r if present, the reagents 
dwi will fluoresce, in order of iMciwfluorccc^oe. arc 
free EtBr iudf, and KtBr hound co the ^ff 1 ^^ 
£>NA primcrt andweh* d^uhle-stpnded target DN A (by 
its intercalation between the sacked bw«a of the. UNA 
douhWicfce). After the first tteRapradon cyde. urgei 
ON A wil] be Ur^ely ci*gie-srraBdcd. After a ?CR is 
completed the most ngmbcint change » £ 
the amounc of cfeDNA (the rCK product uselQ up ta 
fiaveral tntOTgrairw. Fonn«rlv free Mr »Wd » the 
additional dsDNA, TesutOns in an increase in. fiuore,. 
cence. Ther^ u abo some decrease in r ^^°^ v ° A 1 
ieDNA piinicr. but became the binding of EtBr to «DNA 
U much less than to dsDNA, the effect of thiscbwxge on 
xht total Kuorcscencc of the eiur.pie w *mali. The tluor€3- 
,;.ftce incrtaae can be measured by toccWMS exotanon 
iOuminidcri chroUfH die wails of the amplification vcwei 



before and after, or o^rn eondjiumisly during, ch^pmocy- 1 
cling*. 

RESULTS ' * ^» 

?CK In she pro»«nco of F.tHr. Txi order to wsc^a chc 
affecx 0^ I-iBr in PCR, amplifications of the huaie* HLa 
DOa gen^* 0 were perfoi*m«d with ihc dye prc»cn? ai 
conceaaarJoiu from 0-06 w S O ^/cdI U troical conc«x. 
trjdon of fVJBT ujed in t»i»iiaig of nucleic acids toUovuij 
gel cleco-opborcab ia 0.5 (J.^o0. As shown m n^re 3, ^ 
electrophorcna repealed little or no difference m the yield 
or qua&y Of the amplified* product vhethet EtBr v« 
absent or pTMcnt at any of tncfc conccnoraoons, md»^ 
inff char 5tBr doca not inhibit ?CB- 

Detection orf human Y-chrO£ttO30«niC 3pea£e 3 c- 
qaence*. Scaucaec-spoeific, fluorescence eahanccmcat of 
EtBr « a result of JPCR demonstrated in a xenea c f 
amplification* containing. 0.£ ^ml SiBr and prnnerj 
jp«Sc to repeat DNA sequences found on the human 
Y-chroraesome 1 ' 0 . These PCUa ioiciaJly contained ejeher 
60 np mala r 60 nf fcmdc S n K «ata huoiaa or no DNA. 
Five replicate PC^b were be^n for each DN A. Aficr 0 
17 21 24 29 cycles of thcrmocychng, a PCR wr each 
P>(A woo removed from ch« thermocydeT, and lis ftuor 
rracence measured in a spectroftuorocaeter and plotted 
vs. amplification cycle number (Fig. 3 A). The shap< of (Au 
curve reflect the iac£ that by the dwe an ifl 
fluorescent* can be detected, the increwe m £xNa u 
becomiPF linear and not exponent ^ath eyde number 
As shown, th* fludresaence increa3ed about ihre«-*old 
over the background Huorcsccncc for the PCKa cocuam. 
irw human malo DNA, but did not a*nin«itidy incre^B 
for negative control VCRs, which conrancd eulwfM 
DNA or huoiAn female I3NA- The mere nxftle UNA 
pment CO befiia with-S0 og versus 2 n&-thc fc^r 
cyclea were n«eded (SO give a detixoWc inexcaw ^ 
reTcence. Cxel ctcctrx.phoresij cn the products of ih«t 
wnpliheartons aho^d th«» DNA frapnew 3 of the «• 
^eKSe vcre mAcle in the male DMA c,^ S 
ieacdoni and that lurfc DNA iyncicds took place in tb« 

^iHc^n^die Incrc^e in fluorescence was vuuelijtd 
by jimply laying the completed, unopened PCRs oq a U ^ 
Ku^unacor and obotc^prine cbenx thmugh a red 
fifcr. This b shown b tore SB &r to 
beran with 2 off i*ale DKA and choae with no l^KA- 

Penc In order £ demotmrate tha thu approach h« 
Ideate sp«cfidcy to allO^ generic a ^«^ oin 

4 show, the fiuore*emce from «mpU^d ampUficaUons 
ContaininK EtBr (0.5 |i«m1) as detected by photo mphT 

teacuona rc« perform^ usln? pnmcrs «pt**/ «or * 
ther the wUd-type «• nc3tle-c<U mutauon cf th6 humw 
Sbta ^nc^. The C? «eifidrr far each allele is inW 
Sy rfadnff the eicWoiniaailoci sice at the "^^L?. 

of on. pr^ner. By U5^ an *WW«?^ r 
annealing tcmpftrature, pnmcT <?* n ™»~^ d .7^f^ 
pUncaaoL^m «ke plaic only If the 3' ^«°^ B 1^ 

P V.achpair^ a rapUtodonishownu,n^ 
a reaction with cither the vU^-cypc eJlcU^apeaaC 
tube) or slcWc^iJcle sp^ (ngHt Wbc) piwiert. ™^ 
oXUt W ^xre typed: DNA from a ^n^£. 
wiid-tvpe ft-ciobin individual (A-^J; from a hetctoiygo^ : 

DKA w!tart eacH PCR] w« analyaed m crtpta* (* ?^ 3 ' 
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of rcssuoiu e^c^)- The DNA type vas reflected in the 
relativa fluorescence intenudes in each pair of complete*! 
•mplincadons. There wai a flignificmt lncrease In fluorcj- 
teiice only where a 0-gtobia Jleie PNA matched the 
prime? $«• V*Tiea measured on s «pecircflUorarncccT 
Mata not jbowtOi this fluorewenen was about three time* 
tbac present "> a VCB. whera both p-glcbin aOeles were 
^marched to the prizn«r **t. C«l electrophoresis (not 
8 KawrO «tabli*h«d that this in crease in fluorescence was 
due to cbc ryiuhcjia of nearly * microgram of a .DNA 
■fra|pJiejH of the expect rd tis« tor p-globin. There waa 
litde synthesis of daI>NA in reactions, In which the aticle- 
jpcaftc prujaer w^a mismatchad to bath alleles; 

tam&uous 0(micoring of a PGSL Using a fllacr Optic 
device, it Is possible 10 direct excitation mujainacion from 
\ tpeeanfluoro meter to d. PGR und«rgoing th*nuocycUa£ 
and co return its fluorescence to ihe tpocxrofluoreincter. 
Xhc Huoresccnce readout of aueh an arrangement, rH- 
.rtcwc at an E*Br-ccncaininc Mnpliijeadon of Y-chromo. 
jooi« specific sequences from 25 ue; of human male DN A, 
is sho^n m Fiyurc 5. The readout from « control PCR 
Vith no target DNA la also shown. Thirty cycles of PCK 
Vwerc menitorad for each. 

1. The fluorescence trace as a function of time dearly 
■jho^a the effect oi toe ttiennocydlng. rjnoraccnee inten- 
jicy rise* and rills inversely vtih temperature. The fluo- 
reacencc interuicy « minimum at the dena^ran on tem- 
perature maximum at the oimcvttng/cxtoctfioo 
icmper*ture (5CTC). In the negatWocoviffoI $011, camc 
flu9'£3cenc« majanaa md minima do not change jyrniS- 
'tandy over the thirc/ themaocydes. indlcadfl£ thai there is 
&tk daDNA synthflflia without the approrpriace tarcct 
DNA, and there is little if any bleaching of ,W»r dunnf 
rhc continuous Uluminaaoti of tne iampie. 
. "In the PGR containing male DNA, the fluorescence 
jnbama At the anneafoft'exiension temperaiuxe Begin to 
;tfV3 , easc at abuut 4000 seconds of thcrmoeychBg, and 
continue to increase vicb ciraa, tcdienan j chat dsT»NA Ls 
being producer! at a detectable lc*cl. Now chat ihe riuor 
roicence minima at. the denaw radon temperature do not 
/tignlnontly increase, presumably because at thix c*mper- 
'ature there is no dsWA roT EcBr co bind. Thus the cour*e 
of the amplification <s followed by uncJunK the fUore*- 
jMncc increase at the annealing temperature. Analysis" of 
■,tfc« product* of thes« two amplifications by gel clcctropho- 
fesiE show«d a DW A. fragment of the expected aire for the 
irnal* DMA eorttainong saaple and CO dctcctaWc DNA 
^tbefis for the control cample. 

^iSCTJSSION 

Doi^tiatreana proc«sae» such a£ hybrldliadcm to i 5c- 
ifluencc^pceific probe can enhance the specificity Of DNA 
.tUttction by PGIL Tha ali&unadon ol these ptocesse? 
^»£ftna that the apecificity of this homogeneous assay 
;acpejioj aolcly on thac of JPCft- la th« ewe of s!dde*cdJ 
|.«i3CBie, wc ha*w ihovn th*t PGR alo&a has suflSdent ONA 
^uence apccificicy to permit ^eneti< scxeenmf. Using 
appropriate atripKficadan candidons, there is little H0O- 
j*?yoftc produtidon of daDNA in tha absence of me 
J^Ppropriatc target allele. 

r: The specificity required to detect pathogens can be: 
^'tc or l tt5 ^tjjjj l ^ rcquiT^J t0 genetic ccr'eening. 
I; Ypendln^ on the number of padao^ena in the sampk and 
7^ SfliCUnt of Other DNA that ra^st be taken with 'th* - 
^H-Ple. A difficult target is H1V J which requires detection . 
Pi 2 viral g^ncrae thai can be ac the level of a few copiet 
,fcr theuaandi of host cells*. Cumpared *nth ffao«tic 
s^wniug, which is pcrfonarri nn celh eontainin? at leaii 
i.fle.copy of the cargcz sequence, HIV detection reojuinw 
1 niore specinSry and the input of more total 
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flCtflU 4 UV phougnphy of PCR tubed ecntatnine ;m»U£ebrionrf 
isiihg EtBr that are spe66c oo vild-^vc (A) or sicJcIa (S> &U4l«s of 
the human 0-ttlobin gene. The left cf «ch pair of tubes contains 
allelc^pcclnc. primera oo the wUo\-«ypc iiHetcs. the right cube 
primen to th« ««iJe ailclc. The pha Wfrapta ukca liiar 30 
c^icJ*t of fCK and the Input BNAs aud the sll<de« thr/ 
are mdleaici Fifty of DNA «u used to bedn P< Typidg 
; wa> done m inp^atv (1 pun of FCKJ) for each tnpm PNA. 
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ItStfU 5 ConoTuaoui, real-time monitormjtof a PCR. A ftb«r opoe 

va< ue«d tc carry cXCttaU0U Hghl co « PCfe in f*rcgr«*i and JsO 
amitced light hack co a fluoromcter [mc Ejcp4mnentR( Protocol), 
y^mplihcadoc uiing huaan reakJ)NA spca^C primcTi in a PGR 
starting «n«h 20 ng of buaaan male DNA or in a control 
?ClT*«thouc DNA (botxora). wen: mamtowd. Thirty cycles of 
PHR were followed fui coch- Th« t»mp»rirurc cyded hctwycn 
94'C (dcuacuration} «nd SO'C (aftoeahntr and extension). Note in 
the nwJ« DNA PCR, the cycle (time) <3cpcti<aU»* »W«oje as 
fluotflitcence at t*ie anncaJin^^fteejiott t<mp«raiufe. 
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r»N/V — up » microgram amoimts— 4n order to kavc <uf- 
ncietlt numbers of carg«< sequences. This loryc amount of 
starring DNA in an amplification sipiiilauHly Increases 
the background Auorcccoace over which any addiaonai 
auorcaxuee produced by MX muse be .detected- An 
additional complication that occurs vjth targets ia low 
copy- a umber b the formation of the ^primer-dim^r** 
artifact. This i: the result of the extension of ana primer 
using rhe other primor as a template. Although ch U occurs 
infrequently, once it occur* the extension product U a 
substpacc For PCR amplification, and can coznpec« with 
true PCR oorjcM if thoj» targets arc rare. The primvr- 
ctfmcr product U of course dslJNA and thus ia a potential 
eouroe of fake signal in this homogeneous away. 

i o increase PC* specificity and reduce the effect of 
primar-dimer amplification, voc are invesdgaring a num- 
ber of approaches, indudinsf th* ua« of nested-primer 
amplification* chat cake place in a angle cube* and the 
^hoi-aurt", in which nonspecific amp lib cation Is reduced 
by rawing ttw temperature of tha reaction before DNA 
rynth«U begin*™ Preliminary results using these ap- 
proaches suggest thai primer-dimcr !a tScaMy reduced 
wid w u possible to detect ihc incrow* in Kttr fluprct- : 
cent* ia a PCR iwiiaatcd by a wigie H2V genome In a 
background of 10 s cclli. Witt larycr numbers .of celts , the 
background fluorescence concributcd by g-eaonur UNA 
becomes problematic. To reduce this back*rcAi»d, it may 
be poaibie to uac acc^ucnce-spedfic PN Y A-bindiaff dyes 
thai C20 be made to profcr^ndilly bind PCR product ever 
genomic DNA by incorporating rhc dye-binding. DN A 
sequence into the PCR product enrougb a 5' "add^oa" to 
Che Oligonucleotide primer 1 *. 

Wc have shown xh« ifce rleoecrJon of auor^actdoe 
generated by an EtBr-coneuning PCR b straightforward, 
DOLb once FOR ij completed and condnuowly during 
thermocydin^. The ease ^ith which automaton, of sp*. 
cific DNA deletion can be accomplished b the mo« 
promise; aspect of this amy. The fluorescence analysis 
of completed PCRi is already possible virb existing inaeru- 
mencaaon in $0*wcU formar^. In thi* forma*, the fluorea- 
c«ncc in each PCR can be c^uandtaud before, after, aad 
«ven ut selected points during ih*«nocyciirig by moving 
th« melt of PCRs co a 96-microweJi pSatc auoroceacc 
reader**. 

Th«i wtftruraentation necessary to continuous rymcmitor 
m^lopie PCK* ttmulraacoualy ia also wimple to principle. 
A direct oKwnticai of tJjc app&raeui u*cd here 15 to have 
(nulupW fib«ropeicS rrarureit the *xctarion. tight and fiu- 
OTeac^at emitfiaru to aad from muLoplq VCks. The abiiry 
to monitor muluple ?CRs coudftuouslv may a«ow quafc- 
cjtaQon of target UNA copy number. Figure 3 shows that 
die Ur jor the amount of iiaxan y tareec DMA the sooacr' 
duruig PCR a Ituorescencc incre&sa » detected. Prelimi- 
nary experiments (Hi judbl and Dolling-n manuscript m 
prepiiradon) ivith cononuous monitonxiy hav$ shown a 
sciuuivicy to cwo-foW difTeTCncca in iaioal target DNA 
corwxntradon. 

Coa*cT3cly, if e3tm number of cermet molecules ii 
itno^n-^3 it ean be in genede scree nin^-concinuous 
monitoring; m*y provide a means of detecting false per!* 
tivc juid fehc ncs^e renjie. With a known number of 
ars?t molecules, a true positive would exhibit detectable 
aaoresrmice by a predictable number of eycJ« of VOL 
Increases va ftuoxcacenca detected before or after that 
cyctc wuuld indicate pot«nual arafacta. False n«gadve 
results due to, for rampU, iahihidon of DKA polymer- 
ase, may be deceaed by including i*dth!n each PCU vs. 
inefficiently ampUfyine xntLrktr. This marker results ina 
fluoreicence increase only after a targe number of cy- 
cl«a — many more than arc necessary co deoea a true 



positive. If a eample fiuls to have n fluorescence incrc- je 
after this many cycles, inhibition may be suapectcd. GinL 

w$ assay, cwnduriopfi ore drawa based on che preservci' 
or absence of fluoieaeenco ttosiai alone, such conxrois 
be important any «vcnt» betorc any teat bae«d on cbu 
principle u ready £br th« dintc, ail aasessmcnt of m faUc 
posWvc/kbc negative r*tfis wifneed to be ohtninod uni*,, 
a large number of Xaown «amplcs. * 

la summary, me inclurion la PCK cf dyss >*hoec au 0 - 
rcjccnce is cnHancod upon bmdiivff daDNA maltes « 
possible w detect *pecj£c DNA ampllflcxdon from ouc3id c 
the PCR tube. In Uae future, uistrumencs based upon this • 
priacipte may facilitate th« more vrtdesprcad iue of ?C£ 
u» appjicadonf that damaod the: high throughput 0 f 
sampler. 

SXPERlifENTAl. >WTOCOL 
Huanan KLA-DQa i;cac tm^tiftcatioitf cooulnlne K<fin 

pH 6^: 50 mM KC1: * mtv( oniu of Tec DN^' 

v»lymcnM« (Perkin-Zlmer Ceius, NorWk. CT): ptnoic cacd ' 
of human HLA-DQa gene ip*ca« aligonuclcctiUv.- prlom. 
WfR« and OKS? 43 »a appw^mately 1CT co pi «a of Z>Qa Ka, 
-jttvUuci dfkted from a pwvicua t<KK.tion. Edudhiin broaud* 
p*n Sigaa> vas used ai Ac coaewitraoaus ladicatcd in Fiui c'' 
S^ lbermocycflnff proocedsd for ho cycles ia 4 modd 490' 



Y-chromosome «&cei£o PCK. PCRs (LOO ui wed twedoe? 
volume) ujittaiAug O.S H-^tDt litfir ^ere vnpantd u dwftiW^ 
W HLA-DQa, accept with different primort and UrgCL nNAS.*! 
Thrta PC3R* contained 15 pnaoU «irh male ).")NA-5pcafic primal 
ill and YI .r"*'', und eithor 60 ng male, 60 ng female, 2 n< 1 o»lo S 
or 00 hum*a 1?NA. Th«r«ocyd«5 ^> W^Cibe 1 min. and *>on^ 
for t mm oslnj >**rtep<yck" pjoyrwii, Th* prober O^CfdcS for? 
a cunpie vcre 29 «Aiic*«cd in figure 3. KiuorMCcnce mcaaure-, 
tnent 15 described beiow. ; v 
V^ e ! c "*P < * c * fic ' fc™ 48 * K?obua ecne Amplificationj of- 
Vt:j£ i' oluj ^ c ti^i ^gArl of E ( er verc prepared a/ 
**" - " :c*p?urtih differcw pruaers nns 



described tor HLA-DQa above 

tareet DNas. THoe PCP* *soata(nc& eiditi prinwr pair" HCWf 
H0J4A (>nld-cype gbbin specific ptimem} or HCP2/Hp (jfcfc* 
'frgwoin «peeif»c primers^ at, 10 pmol« wach p^mer per 



alfTwnt* 



There primcn Were developed by Vu ct al tl . Tluec 
target DNAs uaed in neparate amplii'icaoonfr-^O eg «di 
huitiAn WA that w horaozypaua for the aiclda «rau (jSj, CMA ; 
Jh« wa« hc!£rorys0U5 for Ac sicfcla trait (AS), or DNA tli« W-' 
horcoiygoiis (bT the va. gloVm Tfcormacycln^ vox for JOi 
cycle* « 9*x for 3 sun, «ad 6fi"G fior i min. uanz a "atcp^rdc^ 
program. An ann«aling lempcrature of i5*C W b«cs ihowa br ; 
Wu et ol. n to pravwie altcJc-specific ajnpt&atlon. Complntefe 
PCR* wtrc p>iOEi>entphcd through a red niter fWrauen. 
atter placing U« rcaedon tubes atop a model TM-S6 trana illume* 
nator (UV-pr<Kju C M s»a Gabriel, CA>. - 

Hhai «Ao«&£c measurement Fluarsscrrurc ffieasurcmcaefl ^^tw 
mad* on PCKa cix.caioinj Ktflr m a rluoroirff' 2 fltoromctcr^|j| 
JMT£X f Edbcn, NJ), CxAudon -*a> *t the SCO am wr.d wldj, 
about 2 nm band^th with & GG <S6 am cut-Otf ftUcr {Wail* 
Crw, Inc., Irvtac, Ca> co c*dtf/le seconcVordcr light liriati 3 " 
light was detected at 570 am vah abanoWth of*bom 7 nm. Ai 
OG 330 tun cutoff filar w»5 used w reooy« tbe exoutften Sghi 
CAadjautma ^norocewx moaitozute of PCR, CoHWUO^ 
aotiiiorinj of a FCK. in proprtoe was accoinpCaKcd u»ing ,v 
spearofluororactcr and cetongs dcsciibcU nber* m wdl a*;, 
noexuptic ocqctsory (S-vr.x car, no. 1960) to both read eNriwb?.' 
Keht 10 , and receive cxnlrood Uytu from, a i'CK placed in 1 ^ J 
a model 480 Cbermi>c7idffr (Perkio-l^nicr Ccius). TH« probe cJ 
of tlie fiberoptic oahle was attached w\iU **S QiinuW<pDjcv" iff 1 
u^oi cop of a PCR a/he (a O.S ml polypropykoe cenuiifue^ * u i 
Vith itt cap removed) ci£ccir*fllY s«ifm| it The expofea" ^P* 
dus PCR tube and the «ad ol chc fibcropuc coble vexe slicW', 
from room tig-be and the room Ughw were kepi dimmtf(i durirtj 
each nm. Tke rtroniioTed PCB- ^aa an asupimaaon of Mi*T 
mc«cD3*.«peci^c rcpc»t 3eq«iAces as descrlbcg: ato**,' eacca 
•uxiag an. ajtme»llo^cxi*n«ioo l emr era aire of.SO'C. Th* rcacuo 
*03 curcred -with mincra.t oil CI! drvpj) to prexaw e^apyr«^ 
TtcTmocydiag and Auorcjceaec nxasurement were «art*^r 
muUanAouily. A dmc>Uase ?c*n with a 1*» iecond inw/nredftn 
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Mi <u»«d and xhe amUilon si gust wis nuloed to the cxduuico 
sif^al to contra! fbr rhanccs In H$fu-«tirec iaicruicr. Data w*r« 
eS U*acd uainf dvc doiiOQOC vcrtio.fi 2.5 (SWCX) <£»a system. 

\v« tfianfc Bob jojia for Kelp **it& Bp*ctxo£\io*matn<: 
/jjcasuran^coM tnd Heath* rb«n To*»g tar adlOng rbts manUSCript- 

t, M«0ij. K., Fftloetw, F.. StjWC S.. i<uV, A., Horn. C. a*d Erlick H. 
ICotj. Specific cn/yffillic aoiplifitailan Zj^A in vfcrrj The palyiii«- 
lie dufii rc^oiwi.. CShSQO «liS81-5?7S. 

t whte. T.J.. Aithcim. N. and ErircA. tt a; im TOc polymerase 

Ciwln reacaon. Ticuda Genet 3tLB6-i9ft. 

3 . Rrlich. P. X. Cidfawi. n. ajii SqtDSky. J.J. 'Ml. ftcrcm, &<U«w» ( n 

4, Sufc* IC CcU'UWl. I J. H« WOf*. 1, Scftaif, 3. J, HiffwcU R., 
Hom. C. r„ Mulli». K.J3, aa<J £.Jick, If a. ie»5. >nmer*direCtcd 
enryiftioc anolifioiiott of DMA *ufc 3 chcnnoJ&bte UNA polymer* 
vr. 3dcncc 339;iQ7wi9i. 

j £*ilu. R. It.. Wakh. P.S., Levenmn, C, a 9ftd £rf«tft* «. A. 199* 
Ceueflc inatrcij of amplified DNA viiii Itnrnolj^rcsi *»uoii**-»tt«ofie 
uligonufUdu^ probe*. Proa Nad. Acad. Sd USA d&5230-b'm. 

a Kwph. 5, Mao*., IJ, K, Mimic K. B» rul«z, B.J.. SMicI* C. D., 
Clnir. 0 and Fo«<im»a-a*cn. A.ff. Ivfi7, X<icna£ca£'cn of' tlQixun 
immudodcHacnq virus tequcoGCJ By ucing tr, utpv tnqrrondc nu&pU- 
QCiioou tnU oS«o,acr «l«iw»je d«i*aion. J- v "ircL Sl:l B90-L694. 
Ciebah. F, V„ Ucjiiercy, M„ Cai. S. Xnw Y. W., CA~ycr. S. in4 

■ Rubin, £. M, 19S?. Detection of utklv e«U ucsva uwi OnLuicm f iL 

Muure im^^n 

; «, Hunt G. T.. Riohu^j, ft. oa4 ICKn^, K. V. 1080 AmpUSaLGa of a 
. K) jMy palvcnarphsc VNTH scamcBI br CM: pOljrtneraiC cruia nstcdan. 

Nut. A\Jds Ea. l«--il40, 
<>. Cau. E D md Dt-ng. M. W. )590, Rapid STiatyslB ud purlflovos of 

pcb*m:rasc ctuiu nuahm pn/vUcU by hi$kr?fjfo rm*n« liquid «tuo- 
. ti-togrjpKy. S:«uehruquea Bc34C-«55S. 

(0. Hocefv ul CctKu, fk. 3. and Pai ccn B- L. ifiSO. 6«p*Jr«u»t» •( 

re3ii vuh low nr.d IcTO CFOsiUokcd palyatrylvnWc y^ing c*i.C(v*««» 
. .al-.«J *ic«rii JUWi, J Chraattiogr. 5*l6:3J-^. 

II Ki'w. ii. r. and rtlorucM. KC. 1969. A^uUUuf *4»cr"«« wicfa 

jtqucnc-< br 8a«re«:caaa ampHScatiea: j cobr eumoicBJcatacion 
: a«». ?rot NatV Aao. 56. US\ Tft-«1 32. 
J 3. ftwflaod, f. Abmm*in, R. Wauon, E- »<i CeL£>Tui, D. R 



1 991. Qeuoroo of TpccSftc pfjtywccasc chain ««e*« prxi^i k« 
ubticinu ebc 5' «* 8* «b»(uiJ«m« activity oT TArraibr o^udfiatr £7N^A 
polymerase- Frac Natl, Aean. 3d. Uaa aansl^-TSOO, 
Mwkurits.7.. B. P. And U J. B. imfj. Etbiduaa di«~*: 

a new reaceiM for die flucsrimrtnc cictcrtfliaartOD of auddc aoda. 

lfi. JCap««ln.U, J. and Sicf. W, 1979. littff*Cflor« of r.O-Obmtt'me-e- 
|h«yUfliWl« with syailioSi. jw^nutf*£ocJ«fc N*«. R*«. 6»J£I9- 

1(1. 6eartc. M. i and Emorry. K.J. 1930* Ce^*«*«^pMA« mto**«a«a of 
U**4«itt 33S66 vitK the nuaor fiwc of an adOUDt-tr&a DNA 
duplw 3w<L*c\I ta sctuCtoa by *H NMti 1 poctmwftpjr. Nuc Adda Kcx. 

17, IX rt- ft., OyllcutcA. V. P., Cui. X F., fiuJo. 9- K. SrU'eh. H. JL aud 
Ai^obaa. N 198ft. AfflpUficatiOQ lAd 0i1'4>TlS of Dr<A jequotci ut 
skogic htftnw roc rr o ud dtpksid Mtt*#*«i I3tc«i l4 .a i,*? 

la. Abboit, M. A« Paw«, B.J., Byrne C Kwot i*. V., 6njotk>. J.J. 
aiulinicti. if. A. 1988, £nrrtnwlc gepe ^vUfl^uop, ^a<U»Bv 4 and 
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^•Tt\Q CO-14 motocuie !e expressad on the surface of 
>f7ionocyres and some rrRr.mphages. Membreno- 
?i bound CO-14 \s a receptor for apopoiysacctariae 
!l 5.ilPS) compla^Gd to LFS-Btndlng-Pnotefn {L3P) v Tne 
v - concantration of tcs sotuble fnrm is aJtered under 
' Certain pathologtceJ condttiona. There ie evidoncs for 
' :3P * iinportant role or sCD-14 with polytrauma. snpAis. 
,i';buminos ana inriammations. 
" Durinc, septic conditions end acute infocttons K seems 
^■■-fQ og a prognostic mai'ker and Is therefore of value in- 
•"i^onfToring mese patients. 

For mora information call or fat 



IBL offore an BJSA tor quanrrtatve determination of 
soluble CO-14 -in numan serum, -piaarm, ceil-ouinjro 
stjfwirn&tanti and other bioioglcai fluids. 
Assay featursc: 12 x S d^smnlnailon^ 
{mlrrntitBr stnp$) ( 
prccoated with a spocific 
monoclonal amibnriy. 
2x1 hnur incubetion, 
standard range: 3 - S6 ng/ ml 
dctaction limit ^ nQ/ml 
CV: intrar nnri intaraaaay < 8% 
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WISP genes are members of the connective tissue growth factor 
family that are up -regulated in Witt- 1 -transformed cells and 
aberrantly expressed in human colon tumors 
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Contributed by David BotSteux uttd Arnold J, Lewe, October 21, 199$ 

ABSTRACT Wot family members arc critical to many 
developmental processes, and components of tbe Wol s«cnal- 
•o2 pathway hare been linked to tumorieeticsEs in familial and 
sporadic colon carcinomas. Here we report the identification 
of tvro zcoes, WISP') and WISP-2 f thai arc up-rccuhtcd in the 
mouse raammary epithelial cell Une C57MG cransrormed tiy 
Wnt-i, but not fcy WuM. Towner wltn a totrd related tone. 
WTSP'3, these proteins define o su*>ftim"> ot the connective 
tissue growm factor family. T>o distinct systems aemon- 
Stratett wrsr Induction to he associated with the expression or 
Wat- J. Tticse included (0 CS7MG cells infected with flWnt4 
retroviral vector or expressing W»t*l under the canirol of * 
Utrflcytinfl reprecable promoter, And («) Wnt-1 transgenic 
mice. The WISP.1 genu ww localized to humon chromoflonte 
8q24.L-8q24.3. WJSP-I genomic DNA ww amplified in colon 
cancer celt linos and in human colon rumors and its UNA 
overexpxessod (2- to > 30-fold) in 84% of the tumors examined 
compared >nth patient-matched normal mucosa. WfSP-3 
mapped to chromosome 6q22-6q23 and also was or t rex- 
pressed (4- to > 40-fold) in 63% of the colon turavo analysed. 
In contrast, WlSP-2 mapped to human chromosome 20ul2- 
20ql3 and Its DNA was amplified, but KNA expression was 
reduced f,2- to >30-fbld) In 79% or me tumors, TBese results 
surest that the WISf genes may be downstream of Wm-l 
signaling and that acerran/ levels »r msp expression in colon 
cancer may play a role in colon rumnrigencsis. 



Wnc-1 k a member of an expanding family of cyjiemo-rich, 
gtycosyUtod signaline, pcolems thai modiato eiverso develop- 
menial proceases such 63 (he ccntrol of cell proliferation, 
adhesion, cell polarity, and the establishment of cell faces (1, 
2). Wnt-1 oricifiaHy was identified as an oncogene activated by 
tic insertion of mouse mammary tumor virus in virus-induced 
mammary frdenocarcinomas (3. 4). Although Wm-l is not 
expressed in The normal mammary fland, expression of Wnt-1 
in transgenic mice ciuses mammary tumors {5). 

In mammalian celts, wnr tamlly members initiate slgnaUag 
by binctine to the seven-fransmembrane spanning Friziled 
receptors and recruiting the cytoplasmic protein DUheveilea 
(DshJ CO t^lC ceU membrane (1, 2, 6). Dsn then inhiniu tha 
kinase activity or the norraaJly canstitudvely active glycogen 
rymhaxa Jcinacft-3jy (OSKJ^) resulting in an iccreice in 
js*eitenlc leveu. Stabilized ^xatenin intftracis the ttan- 
pcriptlon taclor TCF/Lcf 1, forming a complex rMt appsars in 



Tne puc nation com uf this uvtiz oefriiywi tn part vj charge 
payment Tfils irtlde moil merctore oc netcoy tnartcd "tivs/ttsiminc in 
«cortinn« with la VS.C §1 734 *ole!y to Indlats (hb fact 
C 1933 by Tan Nn^n-J A«*Jtmy of Sm«n«* 0Ca%342</03ya5l47l7.6t2.O3/O 
JTSa5 is 4vaivjj*t onunt aiwww.p^.oit. 



the nucleus and binds TCF/Lefl target DNA elements to 
actirate transcripUon (7, 6), Other experiments suehcsc tiUt 
the odenomatous potyposia coli (A PC) fumor suppressor gene 
atoo ptays an imponant role in Wnt signaline by rcgulaiuig 
p-catcj\tn levels (9). APC is phosphorylutcd by binds 
to ^-catonin, and facilkeces dcpredfaioit, Mutaliuns in 
dinar APC or fl-oalonin have been nssociatcd wjth colon 
carcinomas and melanomas, suggesting; these mutations con- 
tribute to the development of theso type;, of uanccr, implicatiAfi 
the Wnt pathway in tumorigencsia C\). 

Although much h&e bean learneo obout the Wnt sienaline 
pathway c^cr the past aoveraj years, only a few of the tran- 
scriptionally activated downstream coniponaua activate br 
Wnt have been characterizsd. Thote tlut have beori described 
cannot lccount Cor all of tb« divcrcc function* attributed to 
Wnt signaling. Among the candidate Wnt target genes are 
tnose encoding tha aodal-roiated 3 gone, Xnr3 % a member of 
the transforming growth tactcr (TCil* >/i stmcrfamify, and the 
OOmSObox genes, engrailed, gcosecaid, n* t« (Vnv«), and siamois 
(2), A recent report also tfent»«es c-myc as a target gene of the 
Wnl Signaling pathway (10). 

To lOentiry additional downstream genes in the wnt signal- 
inn patfiway rim lit relevant to the transformed cell pheno- 
rype, we ascd a PCR-bascd cPNA subtraccon stwiegy, sup- 
pression subencttve hybridization (55 H) (M). mmg kna 
Lwiared from C17MO mouse mammajy epitneiial cell* and 
C57MG cells 5taWy transformed by a Wnt-l retrovirus. Over- 
expression of WiU-1 iii diia cell line is NufTicient cti induce a 

paruaJly transformed phenorype, charaacrlzcd by elongated 
<ind retractile cell? that lose CQntact inhibition and form i 
mulciltycrcd array (12, 13). Wc rciisoncd that genes differen- 
tially expressed between these two cell lines might tomribuie 
to cho transformed phenorypc. 

In this paper *c dcffcribc the ctonint: and characterization 
of two genes up-rejtulatcd in Wnt-1 transformed cells, W1SP-1 
and WTSP-2, anc a third related f enc, WISP J. The WISP jeno 
are mombcra of the CCN family of crgwth factors, which 
includes connective tisGuc growth factor (CTCiF), Cyr61, end 
nov, a family not previously hnkec to Wnt signaling, 

MATERIALS AND METHODS 

SSrL SSH w« performed bv using the PCR-Salect cDNA 
Subtraction ftit (CLONTECH). T^ter doublo-strondcd 

An»lTevlatlQrtS; TCF. trinstQcniing growth fuclcir: CTCF, connecijv* 

ciwuc t^v w vh factor; SSH, suppression subiractive nyonaitatton; 
VWC, von Willtbrand factor typo C mocigl*. 

Daia depot itiort; Th« wqueocw r«pod«d >n ihw pspor ha^o b«an 

acpositsO in the ^cabanV database (acccss»on nos. AH0y/T7 
AF100773. AF10O779, AP100780 «.d Ari007til). 
To ^hom ropnot ro^ywta »HouIg be addrewad. e-maiii diano@gcnc. 
com. ^ 
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cDNA synthesiied from 2 u£ of poly(A)" RNA isolated 
from the CS7MG/Wnl-J ccU line and driver cDNA from 2 ^ 
of poly(A) + RNA from the parent C37MO cells. The sub- 
tracted cDNA library was subcloned fo(0 a pGEM-T vector for 
further analysis. 

cDNA Library Screening. Clone* tncodlne tun-lcngth 
mouse WlSP-1 ~ere isolated by screening a A£tt0 mouac 
cmbfyo cDNA library (CLONTECH) with a 71 ubp probe from 
the original partial done S6S sequence correspond mg to amino 
adds 128-169. denies oncoding full-length human WISP-) 
were isolated by screening AgtlO lung and fcUl kidney cDNA 
Horaries with the same probe at low stringency. Clones eh- 
. coding full-length mouse and human were isolated by 

screening a C57MQ /Wnt-1 or human fetal long cDNA library 
wiih a probe corresponding to nucleotides 1463-1S12. Fuii- 
length CONA5 encoding WiSf-3 were cloned from human 
boae marrow and fetal kidney libraries. 

Expression oi Human *7$P RNA. PCR amplification of 
ftrsuarand cDNA was performed with human Multiple Tiasue 
cDNA panels (CLOK1HCK) and 300 uM of etch dNTP at 
Qa'C for 1 sec, 62°C for 3U sec, 72X for 1 min, for 22-32 cycles. 
WfSP and glyceraldahyda-i-phosphate dehydrogenase piimcr 
soqueacee arc available on request. 

jfn Sit* Hybridization. ^P-labelcd sense and antisensc ribo- 
probca wero transcribed from an K97-bp ?f :R producr corre- 
sponding to nucleotides 60 1-1440 at mouse W7.T/V or a 
294-bp FCR product corresponding to nucleotides 67.-371 of 
mouse WtSP-2. Ail tissues were processed as described <m>. 

Radiation Hybrid Mapping Genomic DNA from eacn 
hyrjflcl In the Stanford G3 and Genebridge4 Radiation Hybrid 
Panel* (Research Genetics, Huntsvflk AL) and human and 
hamster control DNAS were fCR-omplified, and the results 
were submitted 10 tne Stanford or Mass&chueerta Institute of 
Technology web servers. 

Call Lines, Tumors, and Mucosa Specimens. Tiwue speci- 
men* were cctaincd from the Department of Patholojiy (Uni- 
versity of Pittsburgh) lor paticnCS uttdcreoing colon resection 
and from the University of Leeds, United Kingdom. Genomic 
DNA was isolated (CJiagen) from Hie ponied blood of 10 
normal human donors, surgical specimens, and the following 
ATCC human call lines: SW480, COI-0 WDM. HT-29, 
WiDr, and SW403 (colon sderiocarcinomas). 5Wo20 (lymph 
rtodc moueiasiifl, colon adenocarcinoma). KCT 116 (colon 
carcinoma), SK-CO-1 (colon adenocarcinoma, ascites), and 
HM7 (a vuiam of ATCC colon adenocarcinoma cell line LS 
\7dT). DNA concentration ^ae determined bv using HocchSl 
Oye 33258 intercalation f luorimetrv. Total RNA *as prepared 
by tjomogenizetian in 7 M GuSCN followed by centr irrigation 
over CsQ cushions or prepared by using RMAzol. 

Kent Amplification and RNA Expression Analyst, Relative 
girta ampliricaxJon and RNA expression of WlSPs and c-**yc in 
the cell lines, colorectal tumore, and normal mucosa were 
determined by quantitative fCR. Gcno-specifie primers and 
flu erogenic p rotes (sequences available on request) were 
designed and used to amplify and quanticatc the ftcncs. The 
relative gene copy numoer was derived by using the formula 
2tftci) where AC* represents the filfteren.ee in amplification 
cycles required to detect the WISP genes in peripheral blood 
lymphooyle DNA compared with colon tumor DNA or colon 
tumor RNA compared with norma' mucosal RNA- The 
i-mcihod was ueed for caJoulation oi the SE of <he ^CliC copy 
number or RNA expression lovel. The wyrMpecitiC signal was 
normalized to that of ihs grycaraldehyde-J-phOSpJWtc deny 
droacnasc housekeeping gene. All TaqMan assay reagents 
wcie obtained from Pcrkin-Elmcr .applied BioeySttms, 

RESULTS 

Isoladon or WSP-1 and WI$P-2 by SSIL To identity Wnl- 
l-lnduciblc genes, vvc used the technique of SSH u$ing the 
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mouse mammary epUhcJiaJ cell line C57MO and CS7MG eclb 
that stably eitprcss wnn (VI). Canaidatc differcruiaUy «- 
pressed cDNAe (l,3Sa total) were sequenced. Ttirry-nine 
pCKcnt of the sequence* matched known genes or homo- 
logucs. 32% matched expreaaed $ot;uencc tags, and 29% had 
no match. To confirm that the transcript wit differentially 
expressed, semiquantitative reverse crunacription-PCR and 
Northern analysis were performed by using ujRNA from ric 
C57MG and C57MG/Wnt-t cells. 

Two of the e-DNAs, M / Jl/ , -7 and W\$r-7. were differentially 
cxprcssod, being induced In the t>?MC>/Wnt-l Ce)l line, hut 
not in Che parcni C57MG celU or C57MO teOs overcxptesslng 
Wnt-'i (Fig, 1 A and £). Wnt-4, unlikb Wni-l t does not induce 
the morphologicel transformeuon of C57MG ceus and has no 
effect on jtf-catcnto levels (13, 14). Expression of WISP-) wae 
up-raguiated approximate^ Mold in the C57MG/Wnt-1 cell 
line and WISP-2 by approximately -Cold by both Northern 
analysis and reverse transcriptioruJ»CP. 

An indopondent, but similar, system was used 10 examine 
WISP expression after Wnt-1 induction. C57MG cells express- 
ing the Wm-l ecne under the ccnwol of a tetracycline- 
reprcssiblc promoter produec low traounts of Wnt-1 in tha 
repressed stale but show a strong induction of mRNA 
and protein witnta 24 hr after rciracyoiinc removal (8). The 
levols of Wut-1 and wiSP RNA Isolated from these cells at 
verioue. times after tetracycline removal were assessed by 
quantitative, PCR. Strong induction cf Wm-1 mKNA was Seen 
aa early as 10 hr after tetracycline removal. Induction Ot feW 
mRNA (2- to 6-fold) was seen at 4ft and 72 hr (data not shown). 
1 hese data suopon our previous ofcservationa that ahow that 
VS1SP induction Is correlaTCd wiih Wnt-1 wrcssion. Becauee 
tho induction is slow, occurring after approximately 4fl he, the 
induction of WtSPi may be sn indirect response to Wnt-1 
sJgaaliiu;. 

cDNA clones of humfto WISP-} *ere isolated and the 
sequence compared with mouse WISP- /.'The cDNA sequence;: 
of mouse and human mSP-l were 1,766 tnd 2,830 bp ia length, 
respacrivdy. anO encode proteins of 367 an, with predicted 
relative molecular masses of -40.000 (M r 40 K). Eoth have 
hydrophobic N-terminsit slgna' sequences, 38 conserved o>e- 
teine residues, and four potenlial N-llnVcd elycosylacion site5 
and are 84<^ identical (Fig, U). 

Full-length cDNA cion«5 of mouse and human were 
1 ,734 and 1.293 bp in leiujch, rcspectivjly, and encode proteins 
cf 251 iinti 7.10 aa, respectively, wich prodictod relative molec- 
•ular aiasces of ~?7.0fin (Mr 27 tC) (Fie 2iJ), Mouse and human 
WtSP-2 are U% identical. Human WlSP-2 has no potential 
N-Unked glycosylation sites, ami rnOUSC WlSP-2 has one t.l 








Fic- 1. WISP- 1 sod WISP-2 are in ductal b T Wnt-l r tut not VVnt-4, 
expression Ifl Q7MC «Hi. Northern enttlysia of (A) and 

(&) Morcwion m CSTMG, CS7MG;Wiit-l. snd (TWO/ 
Wnt-< cciU. Poly(A) + RNa (2 ^) ^« cuhjceied to Monhem oid 
uiaiyav and hybndheed with a 70-t>p moub* Wf-i-spcciflt u«obc 
(unuio aeldc 373 JU0) or s l<»*-nc- K7.Vf O-sticcilic yrohc (nuciooticlw 
U3fl-Ifi27) in the v uncrartilatea ceslon. eiou wcrdiybKduwdw.ih 
human i^-tctin probe. 
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FiC 2. Encoded am' no 3CU sequence alignmcDt of moue* and 
human WIS?-! {A} and mouse and human BftSP>i (£). Th« pote/ituJ 
cianat sequence Insuifn-like ero*th taccor-binding ptoiein pOF-BP). 
vwr, tiinoaibojpundm (TSP), and Curmmil (CT) domains arc 
underlined 

position 197. W1SP-3 has 28 cysteine residues that are con- 
served anion* the 33 cysteines found In W75f«i. 

Identification ot WISP-3* To search lor related proteins, wc 
screened expressed dequeues tag (EST) oatabasss with the 
WiSP-t protein sequence and identified several 
potentially related sequences. Wt identified a homologous 
protein ttiat we Uavc called WISF-3. A fuiUength uuman 
WISP-3 cDNA of 1,37 J. tjp was isolated corresponding to those 
ESTb that encode a i54-aa protein with a predicted molecular 
into! of 39,293. has two potendal N-linkcd /rfycoeyl- 

ation sitoa and 36 cysteine residues. An alignment of the three 
human WISP proteins showE that WISP -1 and WISP-3 arc the 
UlOSt Similar (42% identity), whereas WISP-2 has 37% identity 
with WJSM and 32^ identity with WISP-3 (tig. 34). 

w/SPr Are Homologous w> the CTCF Family of Proteins. 
Human WSP-L WfSP-2. and WSP-3 f.re novel sequences: 
however, mouse j£ inc sumc as the recently identified 

£lerJ gene. £/«J is expressed In low, but not high, metastatic 
mouse melanoma cells, arid suppresses chc in vivo growth and 
metastatic potential of KA73S mouse melanoma colls (15). 
Human end mouse WlSP-2 m homologous to the recently 
described rat gene, rCcp-J (16). Signihcant homoloor (36- 
was acco to tho CCN family of growth iactors. This family 
includes Chrec members, CTCF, CytSl, and the prctoonco- 
gene nov. CTGF Is a chemotactie and miiogenic lactor lor 
fibroblasts that (s implicated in wound healing and tiOrotic 
disorders snd is induced fly TCF-0 (17). CrrSl is «n extracel- 
lular matrix signaling molecule that oicmotes cell adhesion, 
proliferation, migration, ang«ogcnesi$. and tumor growth (IS, 
19). nov (nephroblastoma overexprcSSCd) nan immediate 
early eerie associated with qutsscence and found altered in 
Wilin* tuwiors (20). Tho proteins of (he CCN family share 
functional, but net sequence, similarity to Wn*-t All arc 
secreted, tystcine-rich heparin binding glycoproteins Chat as* 
codaie with the cell surfaco and extracellular matrix 

W&P proteins extllblt the modular architecture of the CCN 
family, charactered by tour conserved cysteine- rich domain t 
(Fig. IS) (21). 1 he M-tcrroinal domain, which includes the first 
12 eysteino residues* contains a consensus jequence (GCGC- 
CXXC) consorted in mote insulin-ltac rrowth factor (IGf)- 
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Fio, ?. (A) Encoder! amine acid «q\wtur» align meat of human 
WT57*. The calcine raoiduM of WISF-l and Wis*.;* mat axe not 
present in WISP-3 ato indicated with a dot. tiO Schematic rtprcscn- 
tarion of ihc WISP protainc xhowiog mcfofaalnsrrucwie and crstoino 
r esidu**» (v« rtical lloat) . The lour cySCMnc m'tduea ia (ho VWC domain 
tfwt are absent in W$PA ire indicated witli a dot. (C] Supfaasion of 
W7iP cnRNA in Quman a«ues TCR *as p*rform«d oa humaa 
multiple.aseuc cOTa paael* (CLOKTE.af) from the iaciicateo armu 
ami /cral tissues. 

binding proteins (BP). Thifi c«<juerica ts conserved in WISP-2 
and WISP-3, whereai WISP-1 has a glutatrtlnc in the third 
position Instead of a glycine. CTCF recently has been shown 
to specifically bind IGF (22) and a truncated nov protein 
lacking the TGF-BP domain is oncogeoic (23V The von Wil- 
lebrand fcactor type C module (VWC), alao tound in certain 
coJlagetix and mucins. COVCK rhe ncjiE 10 cysteine residae*, and 
w thought' to .participate in pmtciu complex formation and 
olifiomeriiation (3^1). Th* VWC domain of WISP-3 differs 
from all CCN family mcmoers descdocd p/cvioualy, in that it 
contains only six of tha 10 cysteine residues (F1C- 3 A and B). 
A short variable tecion follows the v wc domain. The third 
module, die tniombospoodin (TSP) domain is involved In 
bincf'Ag to Sulfated glycoconiMgatiw and contains su; cysieine 
rcudues and a conserved WSxCSxxCG moiif firu identiritd in 
thromoospondin The C*icnninal (CT) module contain- 
ing the remaining 10 cysteines ia thought to be involved in 
dimerization and receptor btndlnc (2tf). The CT domain i« 
present in all CCN hmUy memDea described to date but i* 
absent in WLSP-2 (Fig. 3 A and B). Ttic existence of a putabve 
iignd sequence and the abtence of i transmcmbraac dosutin 
sucgest thet WISPe arc «ecrcted proteins, an observation 
supported by an analysii of their expression and reaction from 
mammalian cell and bac^lovinas culture* (data not sho*n). 

Expression of WISP raRNA in Humoo Ttfcusa. Tistue- 
specific expression of Human WlSPs *ns cheracterued by PCK 
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analysis on adult and retal multiple tissue cDNA panda. 
Wiw-j expression was seen In the adult heart, kidney, lung, 
pancreas, placenta, cvary, small intestine, and spleen (Ftfc*- 3Q- 
Little or no expression was detected in the brain, fiver, skeletal 
muscle, colon, peripheral Wood leukocytes, prostate, tesrls. or 
thymus, WISP-2 had a more restricted tiscue expression find 
was detected in adult skeletal muscle, colon, cvary, and fetal 
lun*. Predominant expression of WfSP-3 was seen in adult 
kidney and testis and fetal kidney- Lower levcle of WISP -3 
expression were detected in ploconla, ovary, pro&tato, Mid 
small intestine. 

In Sku Localization of WISP-I and W1$P*2. Expression of 
WISP-1 and W1SP-2 was awesaed by in sicu hybridization in 
mammary tumors from Wm-1 transgenic mice. SircriE expres- 
sion QtWfSf-J was observed in stromal fibroblasts bine within 
the HbrovasciUar tumor stroma (Fig. *A-0). However, low- 
level WSP-l expression aiso wes observed focaiiy within tumor 
cells (data not shown). .No expression wis observed in normal 
breast. Like WISPS t WISP- J expression also was seen In the 
rumor stroma in breast tumors from Wm-1 transgenic animals 
(Fig. < £-/V). However, WISPS expression in the stroma was 
in spindle-shaped celia adjacent io capiUax-y vessels, whereat 




FlO. 4, (X, C Jf. ano O) Representative hcrnaio*rlitt/cc3in-ataiccd 
Images from breast tumor* in Wm-l transonic mice The cor respond- 
ing ifcrk-ficid im*j03 3no*»iflg wcwMton wo shown in £ and 
D. Th* iumor is a modor*ioty woU-diuaranuated adenocarcinoma 
snowing 4videne« of adanoid cystic change. At law prjwcr(^ and ft). 
expression of WISPS is seen »n the delicate erancninc fibrovaiculaf 
tumor stroma (arrowhead). A' hitfier tnagntocattoji. expression is seen 
In rhe scrotnal(fc) fibroblasts (C «jiU O), *nd turner eella wc not«"ivo. 
Focal expression of W1SP-1, how&w, **oa obacr^od in tumor celk in 
some areas. Image* of nxprwsion a<» shown in £SI, At'low 
power {£ and P) t axpteaicn of W1SP-2 it teen in cells lying wihun did 
fibravivcutir tumar siroma. At Vgncr macniucaUon/Wcsc cells 
appearec to be acjaccnt to capillary vessels *hcrcas rumor cells arc 
negative (G and if). 



Froc. Natl Acad. Sci. USA 95 (1998) , fc . 

the pre dona in ant cell typo expressing WiSPS wae tho stromal 
fibroblasts- 

Chromosome Localization of (he WISP Gcnca. The chro- 
mosomal location of the human WISP genes was determined 
by radiation hybrid mapping panclc, W($PS is approximately 
W cR from me meiOUc marlcer aFM259xc5 [logarithm af 
ocos (tod) score 1 M i ] on chromosome 8q24, 1 to Sq.24.3. In the 
same region as the human locus of the novH fernily member 
(27) and roughly 4 Mbs distal to c-myc tl&). Preliminary fine 
mapping iudicatcs that WiSPS is located near D«S1 U2 STS. 
WJSPS is linked to the market SHGC-3.7922 (lod « 1,000) on 
ehrtmoioma'iiqU-aOqlJ.l. Human WISPS mapped tc chro- 
mosome 6q22-6q23 and ia linked to tho marker AFM211»5 
Hod - 1,000). WISP-l ia appra\inu«cly 13 Mos proximal to 
CTGF and 23 Mbfl proximal to the ham an oallular oncogene 
MYB (27i 29), 

v Amplification and Abcrran t Exp nesgloo of WISPs in Human 
Colon Tumors. Amplification of proiocncoftcnea is seen in 
many human tumors and haa cuolocic.ii and protnoatic ai(t- 
nrflcahoe. Pot example, in a variety of tumor types, c-rnvc 
amplification has oeen associarcd with malignant progrcaskm 
and poor prognosis, (?0). Because WISP-J rcsidcx in the same 
general chromosomai location (Aq>d) as c-/nyc, we asked 
whether it was a target oi gene amplication, and, if so, 
whacker th« ampiifieation u^ls indepen dent of the evnye locus. 
Genomic DNA from human colon cincer cell lines was 
assessed by quanttiativa PCR and Southern blot aruryiis. (Hg. 
SA and By Both methods detected similar degrees of WJSP-l 
amplification. Moat ocll linos ch<wd significant (2- to a-fold) 
amplification, with the HT-29 and WiOr cell lines demonstrat- 
inc an^S-fold increase. Siimifioondy, ih« puttarn of amplifica- 
tlua obacrved did net correlate with that uba«rv«d for c-my*, 
indicating that the c-myc £cnc is not pail of the omplioon thai 
involves die WISf*-l locus. 

We next examine a whether the Wlhf genes were amplified 
In a panel of Vi primary human colon adenocarcinomas. The 
relative wisp gene copy numner In each colon tumor DNA 
was compared with pooled normal DNA from 10 donors by 
quantitauve tfCR (fig. 6). The copy number of WfSP-t and 
WISP-2 significantly greater wan one. approximately 
2-fol<l £ox WISP-I in about 6U% of the tumors finf A- to a-rold 
tor WISP'S in W% of the tumors (P <: U-U01 lor each), the 
copy number for WISPS was in d is tin guishabk irom one (f = 
0,166). in addition, th« copy number WISP-J was signifi- 
candy higher than that of WISP-l (P < 0.001). 

The lev*l6 of WISP transcripts in RNA inotatod from 19 
adenocarcinonua and their matched normal mucosa were 




FlO. 5. Amplification of WJSP4 fl«nOcr«it: DNA *n colon cmocer cell 
linis (A) Amplification m cell line DNA was determined by quanli* 
tativc PCR. <fl) Southsm Wo« comamfn-j genomic DNA (in ut) 
oigcstec wi'ca^cfiRl (*H c /*«/) OfAnoI (c^nyc) ^erc hybridised* iih 

9 100-bp hunua fYISF-I yrjbc (ami:iO ftdds 16C-219) or a hatB.an 
c-myc probe (located at bp 1901-2000). T>»« MSP and m/c £4ndfl tit 
dotootod in normal crimen gartoinie DMA of lor a lofia*/ film «po*urc 
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Ftc, «S. Genomic amputation of WtiP ******** to'S 
* ™« W relative icuc eooy oumbor of the gcndS m p 

5ffi^^«W* amative 
nS* f«m Drlroary fcunua uncri with pOOICO DN* from 10 healthy 
n*ar, S ± sem from on* *>« - 

^pliL«. Th. experiment was reixa(ed «« Ic.tf thw« : 

a*e«=d By quantise PCR (Re. 7). The level of Wggj/ 
WA present in tumor tissue v-riod *V^w"£S£ 
mY»J«8d(.M0 >2M0W) In M* (18/19) of tho hurnancolot, 
S oimuttd compared with normal adjacent www 
o£ 19 wmors showed greater than VMM overe*pr«*c«_ 
in contrast, in 79% (15/19) of the 

»MA expression was sienif.cendy low« «o the « 
rnucoss. Similar n »!»?./. IHSM RNA *« ^•' ex P rette ^ 
63% (12/1U) or the colon rumors comparer* *ith tho normal 
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F«. r WISP exoresston in primojY human ooliw tumo« 
«Iw to exprtsrion in uocmd mucosa from the ^.P^ 
ufW mUNA in 10 adenocarcinoma 
v 1,,/nnr is liiied under w«- 



mi a nut alive rLiv in« uumj w * * — - - - . ., ■ 

K ^bS.«.. .« -ana .- 8RM ftom u« ^r.m«r 
doni in triplet*. Hit experiment wis repeated « (»«.( 



hie N**- Accd. So. USA f79*> . 

mueoa. The amount of overexprcssion of WISP-3 ranged from 
4- to >40-fold. 

DISCUSSION 

One apwoaeh to undemanding ihe.auDceuiu basis of «•« 
" „ jSentiTy diffwancac in gent expression between anew 
cdb and ntrrnrJ cells. Srrato> based on »™P*» *« 
Sady-Stacc mRHA lovle wTu differ Wt«*co 

crib have been oacdjo jtag^fc-^ 

SSSK-.££ mamn-ary epahcUel eelU U Wt £oro«d by 

W laree of the genes dieted. IWSM. W«M. 

a „ memoers M «C CCN family of growth 

indud« CfGF, CJTM . and rjov. » femdv not ptev,o«ly Unked 

To' SeHLc experimental demon.^ that 
B-raPlnduelion was ttiociaied wiift the cucrcjsion 01 wnt-i. 
S, TcS7MC; eells infeeted with » Wnt-1 reirov.ral 
vecto or CS7MG cell* exprcsSin£ W n t-1 under the control <J 
rtetrJoyli-e-ropresribW P™noler. aruJ the second ^, « 

whercaj notmal breast tiaeue dew not. No HW RJ*Ae*pres 
I^^™c Setcd in mammary ~mo.«. Induced by polyoma 
^rSs mtddU: T tntigon (data not «ho«n). Ihese toa suopat 

could^etult from wnf" sicnallns directly gc««« 
transcription factor regulation or altcnaurcly tUrough wnt i 
S2SS»8 wming on* txanserlprton facto,, wluch m tun, 

re Th:wSfc..»nadm«.onalSuWamilyofthef^ 

of erowth factors. One $ trikin g a.trcrcnce observed in the 

ESi ***** (3D. Hi «™P»"S 
Wf??-1 and WISP-3 way exist at dimars. whereas wisp.?. 
5Si?»0»W«r If the CT dcrmun is al»o unponanl tt.r 
Sor bE WSF-2 mey bind its ,ee«plor rtjrough a 

£ n|lon Of the moieeole the other C» ^ 
members. Mo specific receptor, have been ^^^CTCF 
or nov.A recent report has Shown th«t inie K nn « 
an idhe.iori receptor for Cyr61 (33). jrl/f .„,. „,.,., 

Tn* atron. exprejtlon of W1SM and H'/5/'-2 in eella h^ng 
w,d.;n tho fibrovajMlar tumor stroma in breaat tumors from 
wSa trMsUie «tf«»U * consistent «iih prev,o«« obatr- 
^linaT ^cri F « tor me related CTOF pne are pn- 
mX«pra««d in the tibrOUS SWMU Of manunary rumen 
<34VEXli»l eelU arc ibou 8 h, to Ofttrol the proI.ferat.on or 
[or n«? vc U«ue stroma in mimnui y lumor 3 by . eueode of 
SS-Ato or .i^naU dmfltr to thnt connoHine conn=et.« 

formation during "°und re,»ir H 085 been proposed 
rta^r^umof cells or inibSiwucory ceils ac tnc rumor 

St omi ptolifctation (34). TGF-fll « « rate d by i U«C 
MTttflUCS of malienent broast tum-n and rnay « one of ihc 
gTowrii flctors that stimultt-M the ptcdueoon or ClOh and 

T^oSc: that WISP-! and 
observed in the Stromal cd] 3 thai n.r.oundod the tumor eUlt 
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(epithelial ceils) In the Wm-1 transgenic mouse sections of 

btwt tittue. This finding <u eS cau ihtt 

could occur in *Kich the stromal cells could supply WIS IM and 

W1SP-2 to regular tumor ecu growth on the WISP 

Wtf matrix. Stromal calUcnvea factors in the extracellular 

matrix hove been postulated to play a f**"™^' 

Tnicrnuon and proliferate- (35). The localization of WESW 

ond tVlSP-2 in chc stromal cells of breast mmon supports this 

^ArTSal^of W5^-J gene amplification and expression 
human colon tumors ihowcd a correJaUon between Jf*A - 
ampliation and ovcrcxrreaaion, 

WJSM RNA w ftfl jeen in the absence of DNA ampliirauon. 
la comrvt, MSM DNA wca amplified in the eototf tumors 
but its mRNA expression was significantly reducad in the 
majanty of amors compared with the cxpreeaion in normal 
colonic mucosa from Cfte same pancnt. Tho gene for human 
WISP-2 w*S localized U) Ctiromosorac 20ql 2-20ql3, at & region 
frequently amplified antf associated with foci promos* in 
node negative breast cancer and many colon canccra.su^est- 
in C tha existence of one or mote oncoeencs at this locus 
(36-3S) ficcauce the ccmer of the >Oql3 arnpJ icon has not yol 
own identified, it is possible that the apparent amphfioauon 
observed for WJSP-2 may oe caused 0y another rcnc ui this 

'""A 'ra°eent manweript on <op-L Ac TJC orthotic of 
describee the loss of agression of this gene after ceil 
Uanrfonnntion, sugg^ting it may be a negative regu atm erf 
ecowth in cell lines (16). Although the mecnanism 0y which 
W-2 RNA expression b dowrvregulated dwngvii^ 
transformation is unknown, the reduced expression of w }*r-i 
in colon rumors and cell line* w( *aa that « may " 
a tumor suppressor. These reaulu *how that the WISP genes 
ara aberrantly expressed In colon cancer and sutt«« thai ihetr 
altered expression nuy confer selective growth advantage tc 

the tumor- . ■ .. 

Members of the Will si£nallne pathway been impli- 
cated in th £ pathosfines's or colon cancer, brcaat- cencor.- and 
melanoma, including tne tumor suppresses jene adenomatous 
polyposis coli and 0-catanin (39). MutHuons in specific regions 
of either gene can cause the stabilization and accumulation^ 
cvLotJlaamic fl-catanin ( which presumably contribute* to hu- 
man carcinosis through the activation of J^'j^* ^ 
as the WISPs. Although the mechanism by which Wnt-1 
transforms cells and induces tumorioencsifi >c un^OW the 
Idenafication of WISP9 « ?6nM that may be regulated down- 
stream of Wnt-l in C57MG celU tuggote they cou A M 
important mediator* of Wnt-1 ua ^ otm \ ao ^^^^' 
000 ana altered expression patterns of tho WISP* un 1 hwnw 
COion tumors may indicate an important role for those genes 
in tumor development 

Wc thank the DNAsynthaia ptow for oligonud»otidc 
Baker tor Technical asiiflau*. P. Do~d t*^™^* ^J*"^ 
K. Wflicrc and R. K«jo for ih* wc^auiblc CA7MO/ Wnt-l-OCHt V. 
Dtai for ei-cuesion., and D. Wood and A. «ruce ror artwork 
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We have dmloped a novel "real tmi«" quanOutUe PCR i^ctliod. The method measures l>CR product 
.irrumuuefon through a duaMabdctl fhiot ©ten te probe (Lc., TaqMan ProtMi). TMc method provide* very 
accurate and reproducible quantitation of gwe copies. Unlike other quamU^iivt PCR. methods, real-time PCR 
docs noc require pofl-PCR sample handling, preventing potential YCk product carry-over amamiiivinn and 
re&uMng in much faster and frtelw diroughpUt 'Assays. The neal-tlmc PCR method has a very larga dynamic 
ranfje of joining career moiocuia deUri^ln.itlbn foiU ilvc ovdkn of magnitude), taUlnw nuamUarlvc 
PCR Is tmremcfy acniracc and less Ubor-intehiWe than current fluanfiracJve PCR methods. 



Quantitative nucleic acid seoueacc aif^iysis has 
had an important wife in ninny fields of biologi- 
cal research. M eas women i of geut? expression 
(UNA) has brsen used extensively hi twniliorJng 
blototflrvii mspumes in vanous xilmuh OWi ;: "ei nl. 

Huaiiji ct aL ld95u,l>; Vrud'bomcio el al, 
1 9*»5). QuaniHativr genr analysis (ON A) hns 
in-Oi\ vicxi to tM^vmine th< gunuuM* ^uaniUy o/^ 
particular gene, <i3 in the case of the human HEH2 ■■ 
geiie, which Is amplified in -30% uf breast- -tu- 
mors (Ma men d al. 1587). Gene and g,oiiovnr 
quantitation (,DNa and una) aho have been used 
for analysis of human immunodeficiency virus 
UiiV) buTciwi demonstrating changes in the kv- 
wis of vl nu throughout the <Hf faront phases of the 
elbcase (Connor r.r z). 1993; i'lutak ci al, jvvnii; 
rurtado ol at 

Many method* have heen described for thi: 
quantitative analysis ot nncltflc acid yet|uences 
(both for UNA ondDNA; Southern IVfSy Sharp ci 
ai. hho; Thomas isxn). Kocently, I'Ot "aa* 
proven tc u powerful tool fur quantitative 
nucleic acid analysis. rc;R and reverse trmtedp- 
TH^c (RT)-PCR have permitted the analysis of 
minimal srantiiR quaniuits of nudeJc artery' 
lillte as one c«H vquivalcnt). This hea niadr pos- 
sible many experiments (hat could not hnvc. heen 
pwrcj/oidd wllh tradilional mcthodi. A II hough 
?CK Im provided o powerful tool,, it is Unpcrarivc 



thai n hi* um:U piopdJy tor quantttu\Jan 0*"Mr* 
rnaeKcn 19V$), Mi«y wly reports of ^uantiu- 
ttvr VCK and RT-PCR described quantlt/itUjn of 
tiie product hut did n«t meacura th* Initial 
Varjvi sequence: quantity, ll U es.«ntia] to df.sign 
proper controb; for the qukntiUtion rtf the iriilial 
impyt 5eqocnccx (Hcrrc 19^2; CJcinentl ct t*l 

NvA<mi <hc« hrv<". d«vclope<l scvei^i ni«thod.c 
iirquahtitaUue i'CR and it*r-) J CR. One approach 
mt i «wire5 1 J CK product quantity in th* log pivas** 
r»f ift'c'ruacnpn hefojt; the p)ate«u (Kcllagg *t al. 
1*>P0; Vang ut «K ItHiO). This method roouircs 
That each sample tins equal input anumirti of 
nuck'u* «pd and that each sarrpla under analysis 
amplifies nUJi KleHlit J cffleiency up to (he point 
nf quauiiMivfl analy^Js. A gene icquonco (rnn- 
tallied 111 all sample j ot Ttlativcly constant qu an- 
litVi, such ax p-fl<?Un) can bo ut«d for sample. 
uo/ijlUfrtaUott ft/lieicncy ft«rmali2ati<»n. Uslny 
ecuWenxicinal methods ot vein detection and 
qunntuntuTi electrophoresis or plat« opturc 
hyhrfdlzarifin). It 1$ ejrtrfemcly lal>arlous to a.vure 
that All smnples urc analyscetl during ttm log phase 
of the rcarrlon (furlN)ih tbcUi^cl cenc and the 
hon^iftlization gant-)* AncUwfr ndhod, qtwntiU- 
tlve competitive {QC)-PC;H, ha;»hceti developed 
and i*tittc! witiety for PCR quantnation. QC-PCU 
n;iic* 07» I he inclusion of an internal control 
coinpcihor l» c^ch reaction (Hnrta-Andrc 1991; 
HaTbk el al \99'J;h). The ^Qelcncy of ««ch ro- 
aclion is norniiiliwd to Ok Inlcrnpl compel hor, 

A Wn<vwn »irifmiH at internal COinpalitOT CAT 4 bo 
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addod in ouch flAinp)u. Ya obUTivreUlivp nUAnu 
tnUou, Ilia unknown larRct PCjVpfOciuei is cum- 
pared with the known OTnpeiilo^U'^R-'pift'jdiuCT. 
Aucceaj of a quamllnllvc competitive l*CK asttiY 

relies on developing an internal cununl nun am- 
jJiru^ wiiii the *anie efficiency nethv «*»ol. 

cculc. TJlC dcMKr* of the coriiv^hvi nn J lfav v*0l- 

oauon of amplification cfffiJ unite* jcqiifrc a 
dedicated eCfu/t. Hvwvg/, because QCUl*c:« doe-v 
lunxvquirc ili^i PcrHjnuducts be «>n*ly>saii during 
ch€ Jo<i ptJutW oflho *t ujilific&llori, iff* 0<n WtfUf 
of i he iwta iccthoda to use. 

Sevenil duructluii «y»Wixu we u>vd To* quan 
tlt*Uive 1 J CH a»u1 R'l^KlU ajittlysidi (1) <iguft»ab 
sub, (2)AuurcMxtii taijeihixof i*c;tv ;irop«cw find 

detection wuK li»;u:**fi\duoruI nuoresceiiee vixlii^ 
capillary ctlutttftqdion-Jiin (K«i«ccf *i fth 1995; WIU 
lloms ei .il. 1?9o) or aerylmulde xvl», mid CV* plate 
caphirft and sandwich probe hybrid !/ut |tm (MmI- 
dor <rf ah Alihougti these im'thodf* j«-ny«d 

successful, earn menuici requires pusi-VCR ma- 
nipulations That .tdd-Timu Ui thu aualysh ;uid 
<ii<iy lead Ui hibuialory < mHrtii'iiaal ion. "I'hc 
sample ihtuughput u( lbv*e ji*rlhi«J> '»> limited 
(will i ou- rKtx:r>«ir/n of thy plate capture ap- 
proach), and, llu^/Varc. thcat- method* ere not 
w«J] juHcO fm u*<*> dcnmndiu^ hi^h sample 
Throu^jjju; (I.e., jereenliii; of laxfcC numbers of 

Uc* or cJink'ftl IrlaUi), 

I lax wx" lxqiorr riic dcvciupnu'iu of u novoi 
iixsay r*ir quantitative DNA atudyiriA, Thv «vnay ie 
l»fisr<l on Die use or the ,V nueletue ax.viy first 
dcsaibed by Hoiiund et al. (1093 J. TJfv method 
»iA*tji Oifr S' nuolcAK At:iWUy of 7V#*f pi/lyin cva.it: icj 
dcavc a noncxicndlhlr hyhjidi'/niion prohc dur- 

It)^, the t*x*«ua»!oM J»itns< <.»f !'CI<- Th« njij in if^.lt 
\uca clunMabclcO n\JOrot<TtlC hyhi1clJy.ntlon 
pl'obcs H-c^cl aJ- Jl?P3; J«u«clui- at--.il- iJvoh 

rcp<iricr |fA\f 0-c, ci-cafboxyfiunrciccUOl ^»nd ms 
emission jpcclr* is Quenched by ihcftrccnd flue- 
rwcfint dyr. ( TAMRA (Ix., <J-carlwxy-lclr,nitlciliyl- 

rliodamlnc). T)ic nuclease degradation of the hy- 

I'AM fhu>rv«o:£il cntixil<,ui, A'MiKir<x in an ln- 
<rc«3u In peak fluorescent ^mlailon nt 3 3(3 nin, 
The USc or a sequence delect ar ^AUi l'ri«n) ,iJIt»w« 
fnciiurcinani of fluowcuitf Hpyetrti «i «J1 ^6 v«\-]b 
uf ti^. mcrntai cycler conamumMy aunjin mc 

IX"«R Amplifies thim. Thcrt'.lWc, ihc rvuv Uuu» hiv 
Htoiilliirvd ui real liiue. T)lC ouimn dftlfl la dc- 

scrioea and quantiniiiYc mmiysib at iujiui lai^ci 
UNA fic^uc-\ccJ is di?c\i>3dd beluw. 



RESULTS 

fCR Froduct Dorcalon In R«<ii Time 

Tlic iwul w*ik to devdypa hSglvHhrtm<rbp«t, >cn- 
kHIvv, piwl m:t-*iratt: gcjic qiiyinhJilUm Jiv«jy for 
v?c In monhffirlne lipid mudlal<«J thwapoulic 
gene dcltv^Ty, A plusmld \mcoding human fntlor 
vm g«io «.«<|u<»rte<j, pl/STW (fiWi Ndeihuds). wa?! 

uAcd n> a incx1«1 tlurfftpculie no»w»- TJw ftssh)* vipra 
fluirf^oftt Taqiwan mothotlalo^y mid m Ins I ru- 
men I ta«rinb1c of nicfift\irinK nuoroftCtfiice i>i *c»l 
Ume <AD4 Prism 7700 Sequence JVicnox). Hie 
-t*uqiM«t» rtuieihwi ruqufrii » Jiybildtotlojj iitoIv 
lulled with two different fluunuwent dy«s. One 
dyu \h a rojxirlvr dy« (VA.*4), the other U X quenrh- 
iiiH Jyv (MMttA). Wheii Uic jmJw 1* (loo- 
icaecjil energy haiMfc/ 1 occur? and the reporter 
dye ttuore*-«iit emission is obsOfhcd ny the 
quenching dye (TAMRA)- During tlic ontonulon 
phaw! of the 1'CK cyde. ibfc Ouorcaccui Ijybrid- 
Us lion fifiihc U cleaved by tli* S'-.T nucltolyiie 
octivhy or thr ON A pofymcmsc. On cleavage of 

Uiv probc / the rtf^wrtai' dye cmisiiou U jio lnoflnr 
irana'crrcd cmcicnf.ly to the qticnchlot; dye. to 

vuhil'^ In cm iu<jro3*ct of thw ruporlur Jyu fluarti- 
«C*'t nuixiloii H]Wdfn, I'OH yr;««w. |»rub«n 
vrc/x* dtstguud (<n ilie Jjunnart r^cioi VJ1J se- 
quence and hum 07^ p-^ctl" jjunv 4lt-ji«:r;i>*d 3n 
Mcthuds). Optimization xcneUwi? were pe.r- 
rA/ni6d Uj eftooae II; e upproprlutc prat»o and 
mfl&Ycshun concenuetions yield bi^ llu* lii«lu*^ 
imwwiiy of leporvcr nuorcsecm ai^nsl withovit 
wnc/Jflon^ ^vJfidly- Thc» IflJMrumiinl uh<:h a 
el ia ik u< coupled device (I.e.. CCD L-miuT(i) fvr 
mc^uflng the fluorescein e>nl*Kioa qwlru (tutu 
sou r.$0 nni. Hacli WAX lulx: w«3 mojUtored 
M-qucntifOIy ftir 25 ri\*ci_- with ci*rtlinuouii Jnunl- 
torinA th^>uplM*i"1 • |» J if i«j».t ion- Ufcch Iwbv 

y*a* rc.-cxantlncd every (1.5 aee. Computer ^>/<- 
Whm wu< dt^i^ncd t« iixainljiti Ui* /Kio/cscc/i1 In* 
ten?My cif both the r^]*orre.r \lytr (I-AM).nnd 
\lic quenching dye (TAMIlA). < >»c iiuore^ccnt 
intensity of tlut qu^ncblnfi dyu/rAWl^ cluing 
very Unle nver the course of tlic TCR amn 1 lo- 
cation (dalti not Jiiovrii)* Thfsfftforfl, the Intensity 
of TAVfUA dyv* wntsyioi: serves <(N ah hncrniil 

NlaitJtLfd with which u> »»om milieu iho r<.-|*ortor 
vly»: (FAW) cmlaalon vnruit-Jo^a- 11 \t ftofiware enl- 
cululca .i v<iiu<: termed AHn (or ARO) uainfi the 
foil owing equation- Aftn - (W) (n**"), w!,oi*o 
H" J <-eni)!nloii hituj^iy <W ix.i»onef/eiJib>H«i in- 
tensity t>» cpseitehef nt <uiy given UnK In rene 

tloi] tubv, and - vniiftaSor* i«+entithy of 7**- 
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poncr/cmisslou liucmliy u/ qucnclurr mc«Mircd 
prior lo PCK iiiuplUicaliou in mar came rwH»6i% 
mix:. J'ur thp purpose 0/ cjuauiHatWui, Ujv 1»m 
tiirte d;ita poinii (AUtu) rnnecicti during thttex* 
tension jttp for <rnch J J CK cycle %*rrc anafyy^d. 
nio micieoiyiicoegraoanoii onnc iiyunoixuKan 
probe ocnitt during the cxtetifcUM phase or rial, 
and. therefore, reporter fluorescein 1:111 wnjun 10* 
creases during this time, ri«: iiu«u daw pivititd 
were flvcmjttl for cad* K^-K eyvlc and the hic*m 
vamc for cadi w<*u plotted in an "arupimration 
\t)ai" shown hi I'ijjurc 1 A. The AKn mean vhUk- 
pU'uiccl on Wit? j/«jlx)s, a no nine, rcprcscmed by 
eyen- number, iv plotlcui <iji Uic x-6xle« During ttm 
early cycles of (he T'CU amplification, tl»r ARo 



value remain* at base Hno when xiirfjci^m hy- 
brUJi/allon probe hu& boon cloavc-U Ijy Uiv Xinj 
ixrtyfncrflsc jiuclciflw* activity, the lnuaiiiiy 0/ f<*- 
porier fly<ww»ecjii omfutnti lnyrv«bx%. lA(*t.\ PCR 
unii/llfh^Haiix nrucJi a vrialcou phdni' r>f ropnjtvr 
fiuuiCMViii omission if rue rcuuiiuii UttrrUtf im\ 
tohlph cycle ijuhiUvi*. TJhc omjVKfirMlon plot ]> 
cxuiiilnul vaiiy in Urn reaction, nr 3 jjuIjU thai 

• ,v '"'- : icjwc^cnU Hie lofl phiNC <»r pioducl arn linul 8- 
liun. This Is tlouu by wSAififtltu* srbiUurv 
iVoshaM tu«i <k bwed nn th* vuriuljilUy erf" ihe 
hHurc-iiin.- duiA. In FIfiVTK 1 A, tJic ihrefthoM wA* »cl 
»i la standi! m UuviHdoiiji above (he rnrmi of 

Once the threshold is chosen, the point at which 
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wgure \ PCft procJoct detection in real I'mc. (a; Ttv. Mooei rroD wjfi^are wu» construct amplifiwOon ploti 
rroin The extoncion phai« fiuorosceni emission data coUeaWl during the I'CR amplif1C«Lian. The standard de- 
viation is determined 1r«^ the data points collected from the base tin* of the ampfiflwUcn plot valufet anj 
calculated by determining the poinl 01 which the fluorescence exceed* a threshold HmA (usually 10 times the 
standard devlailon of \hc baw line). <R) Overlay err ampllflcailon ptau of serially (V-Z) diluted human oenum lc 
OWA sanies anipilfiftd w(th B-octm prirnGit. (Q Input DNA enncerrtration of Uvj samples ploacd vartus C T . All 
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ihc amplification plot crosseo iUetltfc:£hoid'HU(? 
fined as C r i* rcporl«*i a* Iho cyclo numbw :H 
i hi* poim. As win be dqmonstrutud, tin* c: ( ^uiiuo 

1st {iieOiU'vt iir the quanii ty of Input tnrgut, 

Cj. Valvm Fravtdir a Quanat«dv€"Hea«u»*emcnt«of 
Input Tarter Sequence 

Plguixs IB xhovs amplificHlioji ploU of ]ii«Alffc»' 
e«l TOR curtplificolionj overlaid. *1V amplify 
rtcmt wore performed on a 1:2 serial dthitfow ^ 
hvrnan ffcnoniic PNA, The amplified tarfid w;u 
hummi p «e.tm. Tfto amplification plofu nhifl to 
th<* right (to fclghet threshed cycles), jiji the Input 
targel fjnanfiiy it red need. ThJ* if Avpcctod b«. 
i'jnwo muitflttfu twith fHwoi- iitRrtin 4 q cop hut of the 
target molecule require greater aiiipllflcilon to 
degrade enough probe to twain the threshold 
fiuome*nc«. An aibitmy threshold of 10 'stan- 
dard deviations obove Ihr hn^r line wag tisyd to 
determine the C r valuer ^ifcurv HircpiofcwitHthv 
C r values plotted vcr*u* (he 'iiiiiipic dilution 
voiuc fcnrh dimvton was amplified in triplicate 
iv;* flmp If fieri turns and plot t yd as uu'wn value-* 
v#Kh crtcc bnn representing one standard devia- 
tion. The Or values OCxreasc* linearly wjirj Increas- 
ing target quantity. Thus* <; r valuta can be used 
« <pumTirfliWc mcasumucm of tin* input target 
number. It should be noied Ujat the amplifica- 
tion plot lor (ho ]5«6-n& sample *hown In figure 
IB docs not reflect the same fluorescein raty of 
increase exnibuad ny mojt of (he other Mmple/s. 
The IS, 6-ji£ sample ahu athiv.vw cndixXnt pia. 
tc*m ot » Ittw^r fltiorescent vajue than would he 
expected baxutl on thi* Input PNA, This phenom- 
enon h«w been observed uccastonaliy with <in«er 
*«rnp1c.i (djia noi shown) and may be Attribute 
able to lata cycle inhibidoJi; tbi.i hypothesis --it* 
*Ull undrr investigation. It Is important to note 
Hod tiu» flattened siopo and early plateau do dot 
impact Aignifie.fi i itly *"hc elicit iaii*cl C n vjltiy at 
dcuwwrntet] l>y the fll on I)k- line gtmwrrjn 
Fisufc. 1 C. All triplicate umpllficniliMix n^uft<«tl in 
*vry sbniloi Ci- values — the .Utindnnl devnatinn 
Otd pot 0,5 for xiny cflJutloji. TJib eApcri- 

iTiimtcontKiniJ « >1OD,000-foW rangr of- Input t^r- 

gt:i mdlrruics. thing C, values Toj' quantlUtlon 
pcnnUs a much larger osflay range than dJ/ccUy 
uainsr toml fluorcscatt. emission inlcnsity for 
(juaniliahnn. TJw linear range, oi Jluoresccnt in- 
tttuity mcaMiremcnt of trie A1W riiiiu 7700 Se- 



Pt Al 1IMI OlMNMAUVI K'-K 
mi'itffi over n very fctiQe rjnigo /i* rolallvf* «ar<inp ( 

t^miilo Preparation VjliHatioii 

SL'vera* paraiTU!t«n UifluciK'c tm« ef'Ur^^y nf 
PCW. lunpUfk-fltion: ma^ncj»lum and Jtult eoaeviv 
aatiom. rcAaton ccsndtuunv (J ( ^. time unci mm- 

p«rtitui«) ( PCH lAi'fiel ^iflc *od compaiitloJl, 

primer aequcncci, und .xflrnptr punry. nil <>f Tlu: 
,atK> v ^ lactWJ ar« Winim»i to a iiiiplc I'CU many, 
cxcrjjt smnple to sample purity, in an effort to 
validate the m^rhod of iuimpt« preparation fo; 
thciactor *\Ktiy f J'CKunjplilkoTitm irprodur- 
thlilry and niflclency m io lepjicaic snmpJc 
pr«*f Miration* w«n^ t!KaiiiSn*d..Afi«:r K<rrtf/mlr DNA 
wjw i?rcpajxd fruiji tjitj 10 replleaie $a7)ipJL'S, mc 
UNA was qutinUtaicd by idl/avjolct speeiroseopy. 
AmpUficaUoni vrcrc performed anaiyzlnr: p-nnlo 
xau: content fn 100 ajjd 25 of total Kcncunic 
UNA, iiach K:K nmplifiCftUon wus pcrfotn\ed in 
•Ulplfeau\ ^'oinpirliioii nf C r values fw cuch tri]?- 
jieat*; isuiiple %how t/iinimul vtsHutJoii ha.-ifd nr» 
standard deviation and roefrtdeni or vnriiince 
(i anic i). Therefore, each o! the ifJplic\t« PCU 
Hinpii«ctitiorLS wu$ highly fcpforiuclbie. demon- 
airatl^n thai reiit nrne 1 J CU using thk irui/v/non- 
ititlon introduce* ntinlmftl vsriailnn Into tbr. 
cjaamiiacive J'CJt wwlysls. Comparison of t»tc 
mean valuer oi \\\« JOfeplicoH* sample ptepti- 
"tiona nl«j showed minimal variability, indicat- 
ing; that each sampJe pntpawticn yi«UW0 sijnil«r 
/e^LiJis for (4-actIn gene quflDo'ty. J he hi^h(vi C r 
difference het-weni any nf rbc sampler was 0.S5 
;tnd 0.7: j or m« joo end 35 wmpit% res*p«t> 
lW<dy. Additional ly. the amplification xif vorJi 
jiflmplf cxhiljln:d «r» ctpdv^lcnl rale of fluo/tt* 
cem en>iKsion intcnMty ehang-e pej- amouni of 
DNA target onfliyzcd fi5 Indicoicd by \itnilar 
sfojw dextved from l he lamplc d<U4t;<\ny (i-n^. 2), 
Anynmpte eontaiain^ an execaaof a I'CKinhibr- 
lor would frxjMhb a ^waier measured p-acdn <: r 
v^Iue for a given quantity of UNA. In addition, 
toe inhibiioi wouiel be diluted along wun tlw. 
.\Ainplc m chc dllMnon anaiyNis (H^. 2). aiicrla^; 
Uke expected vilue ehaj\^. Kach ^amj/c iim- 
pjjfjcaiiou yielded n :indlflr rcmh In me analysis, 
dcmofKtroilii^r tbat this method of sampJc prcp^ . 
ration la hiflhly re[rt'<iducihle vflh regard to 
pic pvtrliy, 

Ouantitaffve Anafvsisof a Pi^inid After 
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Tat>U 1. Reproducibility oflnmplo Preparation Mathod 



Sample 
no. 



9 
10 



100 ng 



tft«nd&rd 
deviation 



CV 



18.23 

ia.33 

1 8.35 

ia.3 

18.3 

1S.42 

18.1S 

ia.23 

1B.12 
1H.38 
18.54 
1P 

1B.2B 
18.36 

1S.4A 
1K.7 

i*.?» 

IS.;* 
1«.36 
18.42 
18.57 

7 8.fi6 

0 10) 



1U.27 
18.17 
15.34 



IS. 39 
18,63 
lft.29 

18,<t2 



0.06 
0.06 
0.07 



le.23 o.oe 



TC.42 0.O4 



16.74 0.2*1 



0.12 
0.1$ 
0.1 

0.12 
0.17 



0,37 

0.35 
0-46 
0.23 
1.2d 
0^£ 
0.H3 
0^ 

0,6£ 

0.90 



20,49 

20.55 

20,5 

20.61 

20.59 

70,«1 . 

20.54 

20,49 

20.4* 

20.44 

20.3 e 

26,68 

20.87 

20.C3 

21 .OP 

21,04 

21.04 

20.67 

20,73 

20.SS 

20,98 

20.A4 

20.75 

20.46 

20,54 

20,4fi 

20.79 

20.78 



25 ng 



standard 
mofln deviation 



20.51 0.03 



0,11 

20.54 0.06 

20.4 2 COS 

?n.7"l 0.13 

21.06 0,03 

20.46 0L0<4 

20.C6 0.12 

20..S1 0.07 

20.73 0.1 

20.6^ 0.1 P 



CV 
0.17 

0.23 

0.26 
0.61 

n.v« 

0.2 

0,57 

0.32 

0.46 
0.9-1 



tor examining a pun ml cUNA for htixu<m fuotur 
vin, pj:«TU. a acrlv* unnsfcciiom was set 

up using, il dcCfCJJlAg amount of ih«r pja«nfdl(40, 
4. 0..*;, and o.l ^.j. l\vcni>*rcjur hours pciM- 
tronjfcctJnn. tomi hNA wax purified tVcim cacti 
/(ash uf {.tils. p-Avliii ^u^jiUty wa* tliwaci i « 
* vti) wr f<,r norniafinaiiHii jicuomic ON A enn- 
rainaiKin ftoiuwli arcunplv. hi ihls trxpeijiimni, 
p-aenn h«<ic: content should remain contain 
rcloilvc tn toTal j£UlUIJJlC UNA. HguTv3 shows lUv 
rWJlt of ihe p-actui UNA mcaMntrmctu (100 
ioiaj D'NA determined by uifTavfolet spectra- 
copy) o( wdt t'mi*i<l«. Kach Aatople ^n* analyzed 
in triplicate: and ine mean puaeun C r values of 
the Uiplicates were plotted ftrror bai3 represent 
^M-..i*rn navunnm 1 MP tUflivHT ilifTrirrnrr 



l»vrw<-uMi j««y s*mjj!(t moan* was 0.1'S *J*n 
nanograms ofiutal UNA of «Jch *a»»pi«i wqjhj alx» 
exo mined Am p*«clln. TllC J Old to 0&MJn showed 

Ulftt very similar amounts <u genomic DNA were 
prcfvnlj'fjic maximum mcun p actio C t v*] u c 
dtffcrnicc wii 1.0. A* J'i^urc 3 *1\owa, the rale o/ 
p-at-tui c;,. LihuriKM U.*Iw*<mi tho )00 and lO^i^ 

Stunf )!«;;» was s/mllttX (aIo^C V<»Jacx ranytt l>CftwqcrA 

^.5rt and -3.45). Thi.t vcrJ^.^s again tVml the 
method of MamplQ prO|J<iratton yicltb ttuTf*y1<*f\ of 
Identical PCH intc^riry (J,^. no taaiplfcr contained 
in OXCOJfclUl* amount uf ^ Inhibitor), I Tow- 
ever, lis car uxuhs Judical a that ewcli lamplccaii 
UlnCd slight diffoienw i« th< >d\*Al amount of 
gwuijnU: HNA aufl)y>xd. iMvnn I nation uf actual 
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fid 1.0 i J* i.d 
log (ng Inpul BODomlc DMA) 

Ffcyun* 2 S<'iipl<i preparation purity- 1 he rr plica hi 
aa«npiM shown fn Table 1 woro also ampifflitd In 
triptcaia using 23 iiq of ©acf* DMA Jflmple, The % v 
uie shorn the inpul ANA cOnccnlrMian (100 «r»d 
25 ngrt v;. C, *a fhf- fTrjuro, the* IOC nnd PVflg 
point* lor oach (ample are connected by a -fine. 



by plolllng th* menn f*-w*:tfii C u value obtained 
fvr raid i l()t)-lig &tuup]u uh j p-ocltii mndxru 
cuiv* (shown in I'lft. 4-0), llio «aUml £CfiuiuUr 
ONA couccutrathm tif each spin^U:, <r„ was oh 
UlUcJ by extrapolation U\ the i<ak;&, 

l 7 i£ur« 4 A showy Uwr jiic^flurccf (t.ii- # imkV 
norm Oil wed) H"utiljlJci of fax tar VJ1I pin unlit 
(prerM) tram each' uf the four transient ccJJ 
lrH"?>r*e<ion*. E«tfb rcuelSon contained J 00 n£ of 
Loial sample 13NA (fti detent Hiuvi by UV «p<Mc<>* 
copy). ll*ch Kftmpk* hji* unul/fccU in iriplU^U' 



/c24J3--3>». ft-* 




m i-e lb 
,<s 9 top** OMA ) 
figure 5 AfialyiUof Udnifcctcd ceil DNA qv^nUlv 
ana* pumv. flic DNA preparation* t*f (hu four 293 
C«JI transactions (40, 4, 0$, and 0,1 ng of pF«TW) 
were anaty^Rd for the 5-axtln gene 100 <snd 1 0 ng 
(determined by uicravioiei spectrtucop/) or each 
sample were amplified in triplicate, For <och 
omount of pf 6TM Hiai was tr&mfcctcd, the p-ncdn 
C T values arp plnUf^* ver*u< rhe total Inpul DNA 

.*>/-^r»^i [Art 



hi \\ TIM I* ouANTfUiivi; pu? 

PC.r* u/npiifi cation*. A« ahoum, pl ! 8TM p^nfit- J 
jfwrtr.flic 293 cells tlticrcustS (mean C, v;duei: In. 
crvui*£j with duCrca*ing amuuuU 4if pla*mld 
vtruiolW lcd. The in can O k values obtained for 
pHVTM 4» Tlgtif £ 4A were platted un a il^nd'jril 
«_urve 6<-mprla«d uf HtHtilly diluted pFHTM, 
shown .In rjpuw The tjuantlty uJ p*x i M, <j, 
fouad in *(xh of the four trflnsfottloiifi v>n.i dc- 
tcTminecl by cyrrapra/auon to the x zxh uf iho 
ftlondard «ur«« 1%uto 4J<« 'HMirtci uncorrected 
valucit, for pI'HTM v;«tv .3iiiriii)i)f/«rl trt dticr- 

iiiirtc/ \3l<* actual imouol of pV§'i*M f^irul p«r TOO 
riK <if tcnojnic DNA l»y ujiImr l)ie cqoailon:, 

ff^V^JJS UCHUI pwriwi Wipife* riej- 

J - - j(K)ng of gnomic UNA 

wlidr<i « *- actui) (ja/tonic TJNA to y win pic an<i 
pr«TM copicj /ram the stdud&ftJ CUTVt:. 'Hie 
iiarin^JJr-cd ^uuriiity «»f pI'ft'i'M pC* 100 hr of 
aomlc ONA foe c«ch of ih« ri,wr f rRfiJufcciloiw 15 
tSnown HI Kl^UfC ^Ji. "Hieac roulls aIwah Ui*t »hc 
qunnuiy of facior viii ptasmiU iijwivJdit^U wiiii 
the 2.^0 ttlls, 2t lir after iou^A^iujk, Or.t.ie^cs 
wi«i uecreasi-mi ijjhmmio ujuuinuaficja ux«i Ju 
the U3vi»*f«c*H>"- Tin: cjuiuitlty of pJ-'H'J'M iiiacvJ- 
utod vtth Z9$ cciia, tifivr uunnfcctloa wjih ^0 u-y 
Of pi^K/iiKl, vas 33 pg per )00 ^(tnoiiHc PNA, 
Tliis fcjuin in * S20 pituniiJ copio pcrXvlJ. 



DfSCUSSJON 

We liavo dcstxJbctl a new itidhcU /ox qu.mtnm- 
iii^ gene vopy n«mb«« /«aMlmc wnutyM? 

of i'CK ampii/icatlonx. Hea!*tlrnc POK mm pat- 
talc with cither of tru? two K'J< (irr-PCR) ap- 

p<VMcliv,>< CiU»4llU«ltj>X' VPOM"=l^' v V w^crc All 

unuiml Lumpcilwr Jtor weft Target Ncijucfioc b 
used for Goorm^ltxaUon (data not dhown) or (2) 
quomlwutYc: coniparadvc uaJng *» iiuuurtlKca- 
cion gt'iie coniaincd within ihcsajnptc (i.e.,, p-ac- 
tinj o/ a "Ji0iiit;keepiJl£" %<:nt far RT-J J CK. U 
«qual aaiounU of nucleic uad exc atwly/al f<ir 
each taxiiplo aiKl if the ampUfWaitfcjn vf/f\irncy 
before qunntUntlvc onrtlyali o *dcnucal for cndi 
tampic, \ne m rental corn mi (iu«iiiii]^iii(»i ^wit 
ur compctit(jr) fllsouid give ix|na«f ><fj*i^l& for «li 
9HmpIcs. 

Tho fcaMim* i'CK mothcxl offc/s s^vral acu 
vnnteiRW over the other two method* t^ir/cnUy 
employed (see the ImrudurOcm). Mrfii, \hv real- 
Timc PCK ^icthud is rK-.rforoicd in i* dosed -tube 
sysuun i»ud requires no |ww\-Pt:U maulpuiauon 
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FJgur« 4 OjimmltMluo analytic erf pFBTM in iranS*C<U*d tcHi. \A} A^un\ ©{ 
ptamfd ONA used for (ho imnsfccifon jrtoncd ag.1ii'i&t Uiu i«*u»m C, value dettf- 
minad fnr pf8TM rcmolnlfiq £-f hr iJficr Ittfirfoctfori. (ff,C) Siand&tU curvrj of 
pCAYM *n<J Or., respectively. pr^TM PNA (J) *<id genomic l>N/\ (Q were 
diluted ally 1;$ be/oro amplification wilhti-ia appropriate primers. Thcp-aclin 
9tA/i<f<i<xl curve wav utod to nAm*al«><f ihr feavlU of A to 1 00 ryj Qt genomic DWA. 
(O) The amount of pfSTM prc*cru p** 100 ng cif genomic OMA. 



lit* ]U\\<Jt>\i*i) fof PC*H COI1- 

fjimlnaifou in the luborjiiory reduced ucCHUftC 
Anip!Ulo<i pj'tfducu urn tn: aoalyriKl and <Jiipanc<i 
of vfiiljuui opening tiu« faction tube?. Sccvndi 
(hii niatlw»4 *«]>j>oiLT 0k- u.w of 4 luinnuK^tliiu 
Rono (i.e.. 0-«ciiu) /or quar\ti«*1iv* PCH orlvouae- 
kecpu-'g scn«* far qunnlSlntlvc RWCU com mis. 
Analyaii h performed In real time (UirOjig Ihc Jo$ 
phase of product accumulation. Analysis ck/rinfc 
phwc jjcnnit.i many dl/A-rcnt £cjic* (ever * 
trWc input target rang,r) to l>c Moly^d ,ilmul)<i- 
ncuudy. without concern of reaching reeuifcin 
pluiacra «t different cyelo. TUi* will wake «iuHJ- 
£cru- andlyih assay? much caMo lo develop, be- 
en u»c WivJtJtial intern i.ujii[jrU(Ui> wfll nut be 
m-cOcd for cocb gene uj,*J«r wiojyoli. Third, 
>d til pic lli'cwghpul will luv.icd>c OraiMttU^Uy 
wiTh the new method hoc&uxe Uici v i* no ij<j«u 
i'CU proceeding lime. Additionally, wen king in a 
:J0-WCll format is highly compatible with a\lUy 

The rvdl-Luiiij PCtt uicthcxf is ltl^lily icpr<>- 
Uucibk. AcpMcaic amplifications can be aiiitlyiccd 



for each sample oUiilmUdng jxHurlUai error. Tim 
Kyvunn i«lIo<«^ iVt a very Ia/fit: 4i\Say Uyn&tuta 
ro»Rv fu|i]Hoae.1iiii{: 1,000,000-foid Martin^ Ui- 
ftut). UalJig u ftUtulard curve £b r ^,h*i target of Iri- 
te/etrt, rvlutlvc coi?y nunil?cr value* can be dcief- 
nilncd fcii- any uuUw^i >3t«ipU\ W\uir«ccm 
thrcriKild yqiucd, C rv <onrJair Ihit-jirly with ffila, 

tjvc {>NA copy number*, Hmc <iujuin^iivc 
RIM'CJI tuclhadolfijy (Oibw/i vi b1„ UHS \KSUt-) 
h^.n olio been dc.vi^lopcd. fariiiUy,. ic^J llo»c ^u^/i. 
tUntJvr J*Ol mot htKioIwgy c«rt l>c Unlevdup 
hig[h-throughptK jcicciilit^ &M$fS for n >»ricty of 
applicqiJom fquaiitJtntlVC gnm« un^wucu (It'i^- 
rCR), ^an« <oj>y «»«aya (flcrfi/ f(c.)j jgC/lO 

typing (kut3ckom mouse, tnulyii*)^ and Jminuuu- 
rrjij. 

Hwil-iiiMc VCli nny abo lie performrd tiji^g 
JnrcroilirOnfi cfyo.si (M<gu^>i cl *L "\WX) tmch us 
oUxidium bromide. The fluorogeruc prc'ic 
mcOiocI offers a major :idvantA$»Q over inicr- 
oaUunfi dy«9-"^rcatcx »vcc]flcffy (I.e., pnmti* 
dlmcrs iiud no«sp<:rJftc PCA rirodudQ ac* not 
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METHODS 

Generation of a N«roM Gmialnlnr a Horrtol 
cDNA for Human Factor Ylll 

Tol^l UNA ]^fvral«l t)\H^A i* ffCTm 'IvriVetr l»c„ 
WU^dAVOOd, TH) fiviu cvJb I— tallied ~'th * lartur V1U 

^jik^Wii 'c<io», pCUst-fecSUMi «Ka«M irt *U dor- 
man < t at ium a factor v|j| partial diNA wwmv was 
^■ih^uo uy irr k* K^ucAnn* la run una IT* Vu 

{p*r1 WtWHWV/3. i*J£ AppUtnl lilflSyttCWS. <HJSU!t aty, cam 
u>*rif the HCH piuiiefJ JWor *»d iiuTuwr wqiidtrp* 
*f* shown b<k'w). 'On- ainpllcon was iMmplirircl tftfnc 
modified Mifru <ind Wrcv pnimn iwmUM vfnn fttuif u 
una #WlN rohtcUnn ,tirc jcqycncc* i»» ihv «»rif 
cioiuxl l»tl<. fKU^- 32 (rrtmu^n <J«(-p.. Mmiwou, Wt). Tie 
MflulUQjc<i«mr, i^IW, was iwrti tor iranwem iraaslcaktti 
of «3R 



Amplification of Target &NA ami Duiecibn of 
AmpticcM Factor VttI I'UfltitdDNA 

<nxi<u^A<*Au:itJrtiMi<«'*vr a«d i=mcv jt-aaakxt. 

<U<*CCiX?CATt:irrAOO-r^. Mlic rutvUujf iwvdmvd* 4*3* 
l^i ia*:w product. Tnc fnrwuftl jirhmf yr« dvalcuud ui fAV. 
nyiilxe « uulquv MMfiwuit" Ami*d J<i ific &' cirjironilfltccl 
Wj(kMi «f Uiu jnu«ui pQ5Z,i*.e$i> |i|»mWr mid riu-rcforr 
OwA i.ui wi^Uu imd amplify Uiw liuuitfti fcciuf Vni 
8<mv. I'fimai* »vcfo chutftm witU Hiy ««iilin wr nf lhe<*onv 
pulcr piogram OliflO (Hrt»ff»r«l hlusclcm'vS, 1m'. t*ly- 
mnutH, UN). Tb* htijtian p-»cUw gmc wn.i RmpUftcti vrlih 
Mwr pdiuem P-«vUm r*««wr*irJ jjrlmcf ,V M>CAi;(XJACA<nt:'f' 

cc^c/*jxrrA<-:CA-3' an J revvrje iyimci v.<:a<;. 
c;cGAACCX:^ .3 1 (:A < r^(;^:(UA*JX;<r-^^ Tiu* maion pi?»- 

6U£C<3 J JV5 hp vftl pffKhivl- 

Amplincailrtti fwciion* (SO mJ) cuitiflin^i a i^na 
lirmpi?, H»x WM ttwffvr jl h4 ^» M tlAlT, 0C;M', 

Tat] TJNa poiymwflK, 0»S Uf«r Ampnruc. AirATli N-ffl* 

pinuU* ttf i*u*}\i p sialic ptltti<*A. TUo i*a\*iU»«v tiVm tviiinmcU 
on* of (hv foM*wiit£ <*t«(vc<i«n (Km nu roHi>; 

XCTAMlUtCCCCrUTOCOATd^a' «licrr p Ijirtkit^ 
phospKofyUiion nnd X incUcMei & linker ami nude<itU>0 
£«a<Uon UMn? WKcniAfnp ftfHlOlt Tlilift (p^tt AUtfl- 

mwm aw*, tvwWn iuu«ji) \im wx« ffva«i i^yrfciu 
Rbu«i) to i~v~-.il M 6 M Irom j*cncd(iUi 'fVbc copi were 
±]niiU«m MIf«*A«ip <J«pj lnll jprcUtlly dc.lJflrtC*' lO pfC- 

iK*. /»eu«r VIL1 pcl»jcn« w«r jiymliMlWU ill C<fIW" 

ui(«B the Oliptj wftwarv, f«|knviil£ tuldoiluwi; SliU.- 

«cs«*i in tnc Modd 77«i .i^ucncc IK-wrt»ir ln,Kiiiiii«*nl 
niuoual, irricnv, pnAu- 'IV. iiiimUl lir fli JcqjI PL iKfiVf 
ihon thr wuivhUur \ru«A'j«iWift: u«0 durfim *iM*fmfcl cy- 
ffitiK; |;rima> should iiuiA'jh* M<i1jk- duplrtftsf wild Ihi- 
pioU-. 

TUu IUC/IUaI ( yc'»U^ cuiiditUim InrlutAud 3 nil" (vV 
5rC Slid 10 mm hi 9S*C. 'Ilu-.nt*il cy*Cig p wedrd «dih 



Rt At iiml piiANmAiwi k:u 

fwcltom wore perT<iniictf in Ihi. Mn/ln/ 7XH)Jirq«cna I3c 
lMH#r A£»p!«*J lOusyO^-uu). «*U»f h amauu 4 Co^ 
AjU|> im^u Nyslgm POOn. ma<(toa cuud:i;w«t w«.f». 
ftTttiWUVU J *Hrw« Mff&rUiiti VlOCt {Apple V#m\*tiOT t 

Santa uara, t:A> iinKed cU/v*ity w in* Model V7fia 

quonw lk.locu>f. Aji*jV u 4,r tt ^ w J M *^' w,n '" 1 

au. M^UtiA *om>*vW». r^UfiUm and *nmlycl*«ofiw«fc 
wu« <leuvJ«pfv! hi m A(njliv\1 nio<y*»u«ij». 

TrmsTeciion of Cc\U wlif* Taccor VIH G)i«ftruc( 

)*nurTl75 ftaskS or JWS ««« Virr.CUl. IS?:<J. ? human 

(tial wdi)oy fuspi'OAian c*ii ime. wva* grcwu 10 K(VK. ron- 
ihirnvy ami tmniKCffa f'l«*«*M. wctv R/«wt ( U> O.u 
MiKiwme mcdin'. swham^ »u wonnoui cut. w» u«ii 
fjucoie iHUlKWi'i inociifird FflKio incdiuii. (»M1*M) wHJi. 
fit n ijTydnc wlUi eodium biwfbittiaip, loaf, l»i^l brivinr 
iuruifi. a him i-HluUtpijK, «nH 1«» p«nicill1i i-*trtf»t«m y- 

*irm- pHlTM Wa ftfliouni« of ^0/ 4, <U, And OJ p-i; wviv 
; t diM t*t 1.f» ml of a winded oonulnlnfi 0,1 as >* 

*nd lx »lttiniS,Tho foul fv»*Jurr* v»»trfc-Wfl al fuo»n k«t«- 

pc/«»w it* 1U rtil«^ **«*l thvn m«J*Iv«1 d«-4*|«*-li«- lt» d*c v^Jly. 
■llic n«>k> ^vu.v.^l.fJ *l 2VC *n<* f % t'X1 2 for 24- hr. 

•« lulled iHis:, ^.ti -^»«*p<wi<iod In Yht? hwir 

jamhK cI *^rlli wr« dlwtdwd wil** «U<(w«<- -nd DWA >*iid t*- 

nnctr d iinmcdtmcly usij»« Uiv QJAjmm 41(1 (Q 1 ^". 
OHiUwrrtli. <^)- l>NAf^L> crUawd Intn 300 r t ^ SO «^ 
TrlvHOI ul pit 
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methods. Peptides AEKfCor AEQKwere dissolved in water, made isotonic with 
NaQ and diluted into RPM1 growth medium, T-cell-proliferatioti wsays were 
done *«cnt«ily as described 50,14 . Briefly, after anogen piOsing (30 fig mi" 1 
TTCfl with tetrapeptides (1-2 rag ml" 1 ). PBMCs or £BV-B cells were 
washed in PBS and fiiccd for 45 1 in 0,05% giuciraldchydc. Glycine was added 
to a final concentration of 0.1M and the eells were washed five times in R£M1 
1640 medium containing 1% PCS before co-culture with T-ccU clones in 
round-boKom 96-weIl microtitre plates. Aftct 48 h, the cultures were pulsed 
with 1 |iCi of J H-chymidine ^nd harvested for scintillation counting 16 h later. 
Predigesiion of native TTCF was done by incubating 200 p-g TTCF with 0.25 p.$ 
pig kidney legumain in 500 jil 50 mM citrate buffer; pH SJS, for 1 h at 37 *C 
Glycopoptlde digestions. The peptides HIDNEEDl. HIDN(N-$uco»minc) 
EEDI and H1DNESD1, which are based on the TTCF sequence, and 
QQQHXFGS^Pm)CSC^IFCLFR(K^CK). which U based on human crwuterrin, 
were obtained by custom synthesis. The three C-tcrminal lysine residues were 
added to the no emit sequence to aid solubility. The transferrin glycopeptide 
QQQHLFGSNVTPCSGNFCLFR v-as prepared by tryptic (FromegaJ digestion 
of Smg reduced, carbcocy- methylated human transferrin followed by 
concanavalin A chromatography". Glycopcptides corresponding to residues 
622-642 and 421-452 were isolated by reverse-phase HPLC and identified by 
mass spectrometry and N-termuul sequencing. The lyophilixcd transfenin- 
denved peptides were rcdiiioWed in 50 mM sodium acetate, pH 5.5, iOmM 
dithioihreitol, 20% methanol. Digestions were performed for 3 h at 30*Cwith 
5-50 ml/ mi' 1 pig kidney legumain or B-ccll a£P. Products ^cr« analysed by 
HPLC or MALDt-TOF mass spectrometry using a matrix of lOmgmr' a- 
cyanocinruicdic acid in S0% acctoftitrilc/0.1% TFA and a PcrSeptivc Bio3yst«ns 
Elite 5TR mass spectrometer set to linear or reflector mode. Internal Jtandax,- 
di2ation was obtained with a matrix iofl of 563.13 mass units. u< \ \ 
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Fas ligand (Fasti is produced by acrivutcd T cells and natural 
killer cells a^d it Induces apoptosis (programmed cell death) in 
target ce^s'throigh the death receptor Fas/Apo)/CD95 (rcf. 1), 
One t important role of FasL and Fas is to mediate immune- 
■cytotbwe killing of cells that are potentially hannhil to the 
'organism, such as virus -infected or tumour cells 1 . Here we 
report the discovery of a soluble decoy receptor, termed decoy 
receptor 3 (DcR3), that binds to FasX arid inhibits FasL-induced 
apoptosis. The DcR3 gene was amplified in about half of 35 
primary lung and colon tumours studied, and DcIU messenger 
RNA was expressed in malignant tissue. Thus, certain tumours 
may escape FasL-dependem irnrnune-cy to toxic attack by expres- 
sing a decoy receptor that blocks FasL 

By searching expressed sequence tag (EST) databases, we identi- 
fied a set of related ESTs that showed homology to the tumour 
necrosis factor (TNF) receptor (TNFR) gent supcrfamily 2 . Using 
the overlapping sequence, we isolated a previously unknown full- 
length complementary DNA from hunian fetal lung. We named the 
protein encoded by this cDNA decoy receptor 3 (DcR3). The cDNA 
encodes a 300-amino-acid polypeptide tfiat resembles members oi 
the TNFR family (Fig. la): the amino terminus contains a leader 
sequence, which is followed by four ttndcm cysteinc-rich domains 
(CRDs). Like one other TTORhomologue. osteoprotegerin (OPG) 1 , 
DcR3 lacks an apparent transmembrane sequence, which indicates 
that it may be a secreted, rather than a membrane-asscociated. 
molecule. We expressed a recombinant, histidirie^tagged form of 
DcR3 in rruunmalian cells; DcR3 was secreted into the cell culture 
medium, and migrated on polpcryiamide gels as a protein of 
relative molecular mass 35,000 (data not shown). DcR3 share-; 
sequence identity in particular with OPG (31%) and TNTFR.1 
(29%), and has relatively less homology with Fas (17%). All of 
the cysteines in the four CRDs of DclU and OPG are conserved; 
however, the carbcocy- terminal portion of DcR3 is 101 residues 
shorter. 

We analysed expression of DcR3 mRNA in human tissues by 
northern blotting (Fig. lb). W c detected a predonunant 1.2-kilobasc 
transcript in fetal lung, brain, and liver, and in adult spleen, colon 
and lung. In addition, we observed relatively high DcR3 mRNA. 
expression in the human colon carcinoma cell line SW480, 

To investigate potential ligand interactions of DcR3, we generated 
a recombinant, Fc- tagged DcR3 proieirL We tested binding of 
DcR3-Fc to human 293 cells transfecced with individual TN1 : - 
family iigands, which are expressed as rype 2 cransmcmbrane 
proteins (these transmembrane proteins have their N termini in 
the cytosol). DcR3-Fc showed a signincant increase in binding to 
cells transfected with f z$V (Fig. 2a), but not to cells transfected with 
TNF, Apo2L/TRAlL* ,5 j Apo3L/TWEAK*' , t or OPGLmiANCE/ 
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KANKL 10 "* 1 (data not shown). DcR3-Fc immunoprccipitatcd shed 
FasL from FasL-transfccted 293 celb (Fig. 2b) and purified soluble 
FasL (Fig. 2c), as did die Fc-tagged ectodomain of Fas but not 
TNFR1. Gel-filtration chromatography showed that DcR3-Fc and 
soluble FasL formed a stable complex (Fig. 2d), Equilibrium 
analysis indicated that DcR3-Fc and Fas-Fc bound to soluble 
FasL with a comparable affinity (K< = 0.8 ± 0.2 and 
U±0.1nM, respectively; Fig. 2e), and that DcR3-fc could 
block nearly all of the binding of soluble FasL to Fas-Fc (Fig- 2e, 
inset). Thus. DcR3 competes with Fsi for binding to FasL, 

To determine whether binding of DcR3 inhibits FasL activity, We 
tested the effect of DcR3-Fc on apoptosis induction by soluble 
FasL in Jurkat T leukaemia cells, which express Fas (Fig. 3a). DcR3- 
Fc and Fas-Fc blocked soluble-FasL-induccd apoptosis in a 
similar dose- dependent manner, with half-maximal inhibition at 
-0.1 u-gml - *. Time-course analysis showed thai the inhibition did 
not merely delay cell death, but rather persisted for at least 24 hours 
(Fig. 3b). We also tested the effect of DcR3-Fc on activation- 
induced cell death (A1CD) of mature T lymphocytes, a FasL- 
dependent process 1 . Consistent with previous results 15 , activation 
of interleukin-2-stimulated CD4-posiuve T cells with anu-CD3 
antibody increased the level of apoptosis twofold, and Fas-Fc 
blocked this effect substantially (Fig. 3c); DcR3-Fc blocked the 



induction of apoptosis to a similar extent. Thus, DcR3 binding 
blocks apoptosis induction by FasL. 

FasL-induced apoptosis is important in elimination of virus- 
infected cells and cancer cells by natural killer cells and cytotoxic T 
lymphocytes; an alternative mechanism involves perforin and 
granrymes''"-'*. Peripheral blood natural killer cells triggered 
marked cell death in Jurkat T leukaemia cells (Fig. 3d); DcR3-Ec^ 
and Fas-Fc each reduced killing of target cells from -65% J to> 
~30%» with half-maximal inhibition at —J ngmT; die^rfetojCial*.. 
killing was probably mediated by the per£orin/gran*yme p>pw ; ay. 
Thus. DcR3 binding blocks FasL-dcpendcnt natu&l TcjUer cell 
activity. Higher DciU-Fc and Fas-Fc concemrations were required 
to block natural killer cell activity compared witi^those required to 
block soluble FasL activity, which is coi&stent^ith the greater 
potency of membrane-associated FasL compared with soluble 
FasL 17 . ^ V ,J ~ 

Given the role of immune^c^totaxic cells in elimination of 
tumour cells and the fact J3$lJ)cR3 can act as an inhibitor of 
FasL, we proposed that DtR^iexptession might contribute to the 
ability of some tum'dursHo - escape immui\e-cytotoxic attack. As 
genomic amplification frequently contributes to tumorigenesis, we 
investigated whetrTcx-the DcR3 gene is amplified in cancer. We 
analysed D ( qR3 gene-copy number by quantitative polymerase chain 
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Figure l Primary sirucigre and expreseioa of human OcP3. a. Alignment of the 
amino-acld sequences of Ocfl3 end of osicoprotegerin (OPG); the C-terminel 101 
residues of OPG are net shown. The putative sis^al cleavage sHe (arrow), the 
cysw<ne-rich domeins (CRD t -4), and me AMinked gtycosyfatibfi aha (asterisk! are 
shown, b. expression of DcR3 mRNA, Nonnem hybridization analysis was done 
y$lng thG 0cR3 cONA as a probe and blots of pcfytA)* RNA (Clontech) trom 
human feial and adult tls£uas cr cancer celt lines. P6L. perfpneral blooc 
lymphocyte. 
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Flgura 2 Imeraalon of DcR3 with FasL, a. 233 caJa were trans fectad- with pBK5 
vector (top) or with pRKfi encoding fuiMenflth F^sl. (bofiom), incubated with 
Ocfl3-Fc (solid line, shaded area). TNFfli-Fc (dotted line) or buffer control 
(dashed line) (the daahed and doned lints overtop), and analysed for binding t«y 
PACS. Statistical analysis showed a significant Olfforence V* < 0.001 J between 
binding of Oca3-Fc to cells ir3nefecied wiih FuaL or pR«. PE. phycoerythfin- 
labeiled cells, b, 293 cells were transfcaed as In a ana metabollcally labelled, and 
cell supamatsmc wore Irnmuncprec/ototed w«h H^aggeC TNFfll. DcR3 or F*e. 
e. Purified soluble FasL (sFasU was immuncpr©c<p'(tatad wlthTNFBl-Fc, 0cR3- 
Fc or Fae-Fc and visualized by immunoblot wHh ano-FasL antibody. aFasL wae 
loaded direcrfy for ccmpartson In the rigm-hand Une d. Fi3g-teese<J sFasL wa£ 
lncubal2d wilh 0cR3-Fc or with buffer and rcsolvec by gel filtration; column 
Tractions were anaiyaed in an assay that detects comp.exea containing Dcte- Fc 
and sF3sL-F/ag. a. Equilibrium binding oi DcRo-Fc or Fas-Fc to sFasL-FUg 
Insex. competition of DcF3-*=c with Fas-Fc lo' olndmg to sFjsL-Flac. 
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• tvr*\» in eenomic DMA from 35 primary lung and colon 
reaction (PCS) in genornit peripheral blood 

rumours, relative to P^^^^J ™f 18 lung tumours 
leukocytes (PBU) of 10 amplification. 

j o „r \7 colon tumours showed uciu gen= «"«i< 
and 9 of 17 colon t confinn ^ resuItj we 

of DcR3-based PCR primers ana ptuu». 

same aroplificarion (data no ^ h °^' ^ primary tumour 

We then analysed DcR3 mWIA "P^ 0 ", £*J DcR3 exprcs- 
tissue sections by in «~ hybnd«anon We .detected ^ «P r 

breast tumours, and 1 out of 1 gastric tumour v 

cwaan colon carcinoma cells, vrtucn snoweu 

D^S^flpliW more than ndghbouring genome + . 
fegSns that are not important for tumour sumval. Tb te>t this. 



we mapped the jta-j. ^J^^^^o 
DdUshowedlu^getoma ker^^toc U ^ ^ 

chromosome posmon 20qL 3. ^ «^ ^ ^ 

SS -pUfication of D*. ^ J&' 

SS&d reverse marker showed an average ^Bf^l 
^threefold, slighdy less than the ^'^^/^ 

>c ft f TVRV the other marker. showed, Jittlc or no 

TNFR family members, including Or<J • k„ w , W r 
FasLTimportint in regulating the immune response: however 
liKlTiTkn^n ibbui how FasL function is controlled. One mcchan 
little is known booul u« modulates apoptosis signal- 

^ ST^ « Srf£f Sntmedlsm invoU proteolytic 
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3 ,nhlb*on cf FS3L ac*vi V by D=*3. * Human Art. 
were incubata* wdh Flag-tagged FasL (sFasU Sngm. , olfi om«ted 

Phoaphate-buffared saline (PBS), human IgGT. Fas-Fc. or OcK > F 00* ^ 
ralaase of "Cr (mean z s.<J. for wo donors, each In tr,pl,caiel 
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comparing each marker wlin DcR3. 
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he Ction of the cytokine interkukin-l has been 
to addition, two decoy receptors that belong to the 
SS fiLy DcRl and DcR2, regulate the Fasl-related apopttiu- 
S^mol^le A P o2L«. Unlike DcRl and DcR2. which are 
m«Sc associated proteiw, DcR3 is directly secreted into the 
meiripraiw r .„ ..„ te H TNFR-farnUy member is 
extracellular space. One other secrctea iiy*. 
OPO» which shares greater sequence homology wtth DcR3 (3 1 W 
EdoDcRl (17%) or DcR2 (19%); OPG functions as .thud 
deSy for AP02L". Thus. DcR3 and OPG define a new subset of 
TWR-fcrni? members that function as secreted decoys to mod, 
*« ^uce apoptosis. Pox viruses produce soluble 
T^molo^es that ne'utralh* speciEc TN.-famdy Jmjj* 
thereby modulating the antiviral immune response. Our results 
SdSL Sat a simL mechanism, namely, production of a soluble 
receptor for FasL. may contribute to immune evasion by 
certain tumours. 



IeG, Mocked with 2% BSAU, PBS. DcR3-Pc or Fas-fc "« added. feuo-«d by 
serially diluted Flag-u 6 g«d soluble JteL Bound ligand was dieted w,th »»- 
S antibody as above. In .he competidon F*-* w„ ,mmobO,«d « 
a00 lt. and the wells -ere blocked with excess IgGl be.ore «Ui«» of Flsg- 
ueged soluble FasL plus DelU-Fc. • 
T-ee» A.CO. CD3* lymphocytes were isolated from per.pr.ersl blood of 
individual donors using and-CDi magnetic beads (Vultenyt Btotech> 

dilated with phyto^ 
iatheprcscr^oLterle^^ 
welUcoatedwi*and.CD3andbod y (Prarm^ 

16 hUier by FACS analysis of aiinexin.V-bind.ng of CD4 

NaW n,l killer cel. actlvhy. Natural killer cells were acWftp P^erf 

blood of individual donors using and-CDSfi mynwc beads (MUteny. 
IZ I incubated for lib with ''CrWe^'ekac eeUs a, an erTeoa, 

to-aJ ndo of 1:1 in the presence of DilU-P^fs.-Fc or Human IgGl. 

TaS«U death was determined by release qC/'Cr m dtota^jep- 

6,r,'<H.mp(Weatlon 8 na.y S l*f urgieil specimens were provided by 1. Kern 
dung tumours) and P. Q^ettSa tumours ). Gcvonuc DNA w* icmatd 
Sen) and the cop^n was detern.ir.cd usir-g Hoeetut dye «K 
LLladonuuo.o^ 

PCRiod Sou&erhhybridhadon data fo, tr* Mye and HER-2 <><>"«« ^* 
ndf sho'wnV Gene-specinc primers and fluorogenk probes were ics.gned on 
W oKhe ^ of DcR3 or - : nearby ^ ^ ~ « £ 



the iniual clone loau aoi sncuj. .P3^ s T " 

Fcf-slor, protein, (.mmunoadh.slno). The entire : D *3 sconce or jhe 
^domain of Fas ot TNFR1. was fused to the hinge&nd Ftrepon of human 
"gi, ^pressed in inject SF9 cells or in huma^ «*. and punfied as 

^^ee-aeth-ated ee.l to^^^ 

. . . _ r-cr fYv**^«\ w rktiRKS vector or pKJW 



ecus using caldumphosphateor-Hfe^^ 
eroding full-length human F*<(J H0. »6«h« with pRKS eneod w6 CnnA 
£b> » prevent ceU de*h: Afcei I6h. the cells -ere ^abated voth 
biocSylated DCR3-FC or TSFRl-f e and then with P^co^nn-con^ted 

cT^r-traisfected cells by DdU-f c « the,. ceU, express hrdc f»Ud.« 
not showrlj;;it ii possible thai DcR3 recogniied some other factor that « 

^Xreeip iW ,on. Human 2 9 5 cells were peered - *JJjg 
metaboUcally labelled with ("Sleysteine and [»S methionme .5mCu 
Amersham). After !6h of culture in the presence^ of r-V 
the medium was immunoprecipitated with DcH-Fe. ^s-Fc or TWRi-Fc 
(3 rt ). foUowed by protein A-Sepharose (Repligen) The V"W<£*™ 
esolvcd by SDS-PAGE and visualiied on a phosphonmag" (Fu,. BAS2000)^ 
AltemaJiy. purified. Flag-ugecd .olubW FasL (1 ^ -» 
with each Fc-fusion protein (1 rt ). precipitated wtth protem 
solved by SDS-PAGE and visualized by immunoblotang -.th rabbet anu- 
FasL antibody (Oncogene Research). 

Analysis of complex formation. Flag-tagged soluble FasL (2S m> -» 
fncuSTd with b«L or with DdU-Fc (40 , E ) for 1.5 h at 24'C The reacuon 

with PBS 0.6-ml Actions were coUected. The presence of DcPO-Fc-EasL 

wdls precoated with and-human IgC (Boehringer) to °?™J>™^ 

Sptavidin-hoxserad.sh peroxidase (Amersham). Calibrate of the col otnn 
ica,eda n appareatrda a v e molecularm,s S ofthecomplexof«OK(daranot 

to two soluble FasL ho motrimeis. .. „, 

equilibrium binding analysis. Mkrotiue welU were coated ««h ant-human 



ivaflible marker wtuch maps to -™ 7" : 

markers chat span chromosome 20. The DrfU-spo-tfic pruner seances were 
CTTCTTCG CG CACG CTG - 3 ' andS'-ATCACGCCGGCACCrXG-S and the 
LSSSSi seance w,s S'-(fAM-ACACGAlGCGTGCTCCAACCAG 
AAp-(TAMARA). where FAM is 5 '-fluorescein phosphors 
Z-copy numbers w«« derived u.ing dse for^la . where ACT « the 
difierence in amplification cyd« required to deiea I?e R 3 .c penpheral blood 
lymphocyte DMA compared to test DNA. 



K=Sr?.^^^^^^^^■-- , *'* A '" ,l 

W^l D. Pl-U d«th «nd,r ctnirof. N^M 111- ' ( ^ ^ 
ill, *72-»7S(i993), 
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Crystal structure of the 
ATP-binding subunit 
of an ABC transporter 

U-Wef Hung*, iris Xiaoyan WangT, Kishiko Nikaidot, 
Pet-Qi Uut, Giovanna Ferro-Luzzi Amest & Sung-Hou Kim*? 

* £ O. Lawrence Berkeley National laboratory, t Department of Molecular and 
Cell Biology, and % Department cf Chemistry, University of California at Berkeley, 
Berkeley, California $4720, USA 

ABC transporters (also known as traffic ATPases) form a large „ 
family of proteins responsible for the translocation of a variety 
of compounds across membranes of both prokaryotes and 
eukaryotcs*. The recently completed Escherichia coli genuine** 
sequence revealed that the largest family of paralogbus' E £ <olt 
proteins is composed of ABC transporters 1 - Man^e^L^iryatic 
proteins of medical significance belong to th^ramilyj 7 such as 
the cystic fibrosis transmembrane conduciinccregulator (CFTR), 
the P-glycoprotein (or multid rug-resistance protein) and the 
heterodimeric transporter associated* ^tn A . antigen processing 
(Tapl-Tap2). Here we report the ci^taEstructure at l.S A resolu- 
tion of His P, the ATP-binding subanit of the histidine permease, 
which is an ABC transporter from Salmonella xyphimuriurru We 
correlate the details of this structure with the biochemical, genetic 
and biophysical properties of the wild-type and several mutant 
HUP proteins. The structure provides a basis for understanding 
properties of ABC transporters and of defective CFTR proteins. 

ABC transporters contain four structural domains: two nucleo- 
tide-bpd&g ; domains (NTBDs), which arc highly conserved 
throughout the family, and two transmembrane domains*. In 
prokaxydtes these domains are often separate subunics which arc 
assembled inro a membrane-bound complex; in eukaryoces the 
domains arc generally fused into a single polypeptide chain. The 
periplasmic histidine permease of S. typhimurium and £ coli iJ ' s is a 
well-characterized ABC transporter that is a good model for this 
supcrfamily- It consists of a membrane-bound complex. HisQ MP 2f 
which comprises integral membrane subunits, HisQ and HisM. and 
two copies of HisP, the ATP-binding subunit. HisP. which has 
properties intermediate between those of integral and peripheral 
membrane proteins*, is accessible from both sides of the membrane, 
presumably by its interaction with HisQ and HisM 6 . The two HisP 
subunits form a dimer, as shown by their cooperarivity in ATP 
hydrolysis 5 , the requirement for both subunits to be present for 
activity*, and the formation of a HisP dimcr upon chemical cross- 
linking. Soluble HisP ako forms a dimcr 3 . HisP has been purified 
and characterized in an active soluble form 3 which can be recon- 
stituted into a fully active membrane-bound complex*. 

The overall shape of the crysral structure of the HisP monomer is 
that of an T with two thick arms (arm I and arm 11); the ATP- 
binding pocket is near the end of arm I (Fig. 1). A six-stranded p- 
sheet {(33 and PS-P 12) spans both arms of the L, with a domain of a 
o> plus 0-trpe structure (pi % 02, p4-£7 ( ctl and a2) on one side 
(within arm I) and a domain of mostly oc~ helices (o3-qc9) on the 



ARM If 




figure 1 Crystal structure of HisP. a Viow of th<j dlmer along an axis 
perpendicular 10 lis Two-fcld axis. Tne lop snd bottom at The dlmer ere suggested 
w face towards the periplasms and cytoplasmic side-5. respectively (see igxi). 
The thickness of arm II is ebouUSA. cor*paraeie lo that of membrane. a-Haiicus 
9ra shown in orange and p-sheeta in green, b. View along the two-fold axfs of the 
HiaP dlmer. showing che relative displacement of cne monornce not op parent in 
a. The p-strande at the dimer Imerface ara labelled o, v<e*v of one monomer from 
the bonom of arm I, as shown in a, lowarda arm II. showing xhe ATP-binding 
pcck3i, a-c. The protein and the bound ATP are m 'ribt>cn" and 'baii-and-sfck' 
representations, respectively. Key residues discussed In the text are indica'ec" in 
c Thece figures were prepared with MQLSOftlPT 3 ', K amino terminus: C. C 
terminus. 
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HOVEL APPROACH TO QUANTITATIVE ^^^^^ 
REAL-TIME DETECTION: APPLICATION TO THE DETECTION OF GLNB 
AMPLIFICATION IN BREAST CANCER 

iLaboraioired-Oncogeneti^e. Centre Rene Huguemn. St-Cloud, France 



Gene amplification is a common event ,n the £ 
human cancers, and amplified oncogenes have bcc" ^°^ 
have diagnostic, prognostic and therapcut.c ^rcle 

baled on fluorescent TaqMan methodology ^» ^**™T 
ZTZ?/ a oi pritn, 7700 Sequence Oetect«on System) capable 
of mcitr no TnCoreVcence In real-time, wa, used 
£ne Amplification in tumor DNA. tattcm 

fe£-ro^ 

p^^ meaning that va.ues are highly 

itans starting with the same copy number. This greatly 

-^nrovL the orecision of DNA quantification. Moreover, 

££££ SfRdoe not require pSst-PCR sample handing. 

InereC preventing potent*! PCR-product <»W™%£ 

SrSlon: it posslsses a wide dynamic *^ 

lion and results in much faster and higher sample throughput 

Th2 real-time PCR method, was used to develop ^ 

a sfmple and rapid assay for the detection and 

It I! most frequently amplified genes (™vc, 'crrfl and 

IroBZ) in Sreast tumors. Extra copies of myc ccndl a* I erbBZ 

were observed in 10. 23 and 15%. respectively, of ^108 1 breast- 

vunor DNA; the largest observed numbers of gene copies 

broad applications In clinical and research setfngs. Int. 1 
Cancer 78:661-666, 1998. 

O /9P5 miey-Llss. Inc. 



Gene amplification plays an important role in the 
of various solid tumors, including breast cancer, probably because 
over-expression of the amplified target genes confers a setabve 
advantage. The first technique used to detect genomic "^auoa 
was cytogenetic analysis. Amplification of 
rc S ions. visualised cither as extrachromosomal double routes 
(dmins) or as integrated homogeneously staining «pow C«SRaJ. 
arc among the main visible cytogenetic abnormality a 1 breast 
rumors. 0*er techniques such as comparative genomic 
ST(CGH) (Kallioniemi « al. 1 994) have also been used ,n broad 
searches for regions of increased DNA copy numbers m rumor 
ceUs, and have Revealed some 20 amplified chromosome regmru » m 
breast tumors. Positional cloning efforts are underway to .denufy 
"epical gene(s) in each amplified region. To date Senes known 
to be amplified frequently in breast cancers include W^J 
ccndl (Uql3). and er*B2 ( 1 7ql2-q21) (for rcv.ew, sec B.eche and 
Lidereau, 1995). 

Amplification of the myc. ccndl. and er6B2 proic-oncogenes 
should have clinical relevance in breast cancer since i"^"*" 
Studies have shown that these alterations can be used to identify 
sub-populations with a worse prognosis^ perns et al 992 
Schuuring et al.. 1992; Slamon et ai. 1987). Muss etaL (1994) 
SSed that these gene alterations may also be useful for the 
PSandasscssrlentofthccfficacyofadjuvant chemotherapy 

and hormone therapy. . 

However, published results diverge both m terms fie- 
oucney of these alterations and their cluneal value For mstanee, 
0S0 radies in 10 years have failed to rttoWc me controversy 



surrounding the link suggested by Slamon etaL 09j£ ber»een 
erbm amplification and disease propesswn These discrepancies 
« pardydue to the clinical, histological and etluuc h^rogeneiry 
K cancer, but technical considerations are also probably 
involved. ... 

Specific senes (DNA) were initially auanuficd m rumor cells by 
JaK bloning procedures such as Southern and dot blowing 
These batch techniques require large amounts of DNA . (5-10 
u^reaetion) to yield reliable quantitative results. Furthermore; 
fil l care is required at alt stages ol the procedures » 
™S blots of sufficient quality for reliable dosage analysis 
Recently PCR has proven to be a powerful tool for quanaianvc 
DNA » especially with minima. J^ft""™ 
samples (small, early-stage tumors and formal. n-f.xed. paraffin 
embedded tissues). . 

Ouantiutivc PCR can be performed by evaluating the amount of 
product X after a given number of cycles (end-po.nt qwitua- 
rive PCR) or after a varying number of cycles dunng the 
exponS Phase (kineticTu*ntitative PCR) In the first case an 
mSstandard distinct from the target molecule is required to 
So PCR efficiency. The method is relatively easy bu» : implies 
senentinB. quantifying and storing an internal standard for each 
SrZied. Nevertheless, it is the most frequently applied 
method to date. 

One of the major advantages of the kinetic method is its rapidity 
in quarifying a new gene, since no internal standard is <e£«ri (» 
ex^mal standard curve is sufficient). Moreover, the kinetic method 
STKy^mic range (at least 5 orders of '™S«tudc), = 
aT accurate value for samples differing in their copy number. 
UafoZately. the method is cumbersome and has |ftercfe«. been 
rarely used: It involves aliquot sampling of each | assay jm « 
reauUr intervals and quantifying, for each idwuot, the 

novel approach using fluorescent TaqMan methodology and a new 
Mi (ABI Prism 7700 Sequence Detection System) capable 
oTmcasuring fiuorescence in real time (Gibson a al 1996; He.d « 
al ^SnS TaqMan reacrion is based on the 5' nuclease assay 
L descriS by Holland « ai (1991). The lairer uses the 5 
SlcSctivity of Taq polymerase to cleave a ^«^^ 
oligonucleotide probe during the extension phase of PCR. The 
app S S uses dial-labeled fluorofienic ^™™£*«P5 
aaL 1993). One fluorescent dye. co-vajcntly Unked to th^ end 
of the oligonucleotide, serves as a reporter V^.^l^^L 
fluorescein)] and its emission spectrum is quenched by a second 
Z> Seen" dye. TAMRA {.i.e.. 6-carboxy-telramemyl-rhodarruno) 
attached to the 3' end. Dunns the extension phase of the PCR 
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cycle, the fluorescent hybridization probe U fcgjj**^ 

The fluoresce ^ « «™^£ffita^.r. 

i«^«f^ f ^ 0I S^i^xJx-rtwdami«) included in 
reference dye («.*.. ROX, 6-c»™oxy ^ iu (normalized 

TaqMan buffer, to otain « f « 0 /^f^ c ^ f a S ™ce detector 
reporter) for a given rcacuon tube. The » ^ 

enables the fluorescence spectra of all 9 * °; ifi ^ do - 
eytler to be measured continuously during PCB. 
"The real-time PCR method offers several 
cunSm qualitative PCR method: > (CeU "^^fi £ r 

measured when ampiir.c4u.ji hy r & a more 

product accumulation. This .s the main 

can have a drasuc ettcci ontne«w r f 
samples. 



MATERIA AND METHODS 
JUmor and blood samples 
Samoles were obtained from 108 primary brc«t tumors removed 

patients had undergone radiotherapy « < *™W,- u< , uid 
Lly after surgery, the rumor ^J^^na"^ 
nitrocen until extraction of high-molecular-wcvgrn ^\ N 
included in this study if «X P «l"SieS5 
^/B^- £ " is of the U 

PS DNA was extracted from tumor t^ue «d blood leukocytes 
according to standard methods. 



nf extensive degradation) are both difficult to assess. Wo therefore 
SZZSSfLx* ienc (alb) mapping « (I>M« « 
2l5l3 in which no e^tic altcnmons have boa found hi 
Si'ramor DNAby means of CGH (Kallioo.cm. « at. 1994). 
-rw the ratio of the copy number of the target gene to the copy 

termed "N'\ and is determined as follows: 



after a fi^ed number oi cycles. The h.gher the .tarongc py 

of the genomic DNA target, the earlier V'^Sid cyck) * 
fluoresce is observed. The ^^S.^'ffStS-U 
defined as the fractional cycle number at above 
generated by cleavage of the probe pas.es a r ^^, cs is 



coov number of target gene japp. myc. ccndl. er&BZ) 
N = copy number of reference gene 

P,Wr< D ro6« reference human genomic DM and PCR 
C0 Sef^ 

SsfPerkin-Elmer Applied Biosystc.ns. f^' , 

Jrimers were purchased from ^Agency (Malvern, PA) and 
nrobes from Perkin-Elmcr Applied Biosystems. 
' iSlSddc sequences for the oligonucleotide hybndoaaon 
orobes and primers are available on request. 

The TaoMan VCR Core reagent kit. MicroAmp optical tubes. 
The TaqMan rv_n v, ore v ^ ' . . — Applied Biosystems. 
and MicroAmp caps were from Penan-c4mcrn. Pl »«. / 

standard cuw. The lancr was ct^uuw 

ofPCRproductsnngmsfrom lO ^ ^ ^ 

10-ia noi copies). This senes of diluted p™« 
sliquotcd and stored at -S0°C until use. 

The standard curve was validated by analyzing 2 known 
^JStiSrfSSbS -uman genomic DMA (20 ng and 50 ng)- 
PCA d^tfe'rio* Amplification mixes (50 pi) con«a»ne<l the 

A^pSe tSSS^yh- (UNOJ.3W nM each and 
JoOnM probe. The thermal cycling ^ S 

in-r nnd lOmin at 9S°C. Thermal cycling consisted of 40 
S-cSf 1 rS "«?C for 1 min- Each assay included: a standard 
VL mi ,„ in^ cooies) in duplicate, a no-template control, 

samples. 
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Determination of gene amplification. Gene amplification was 
calculated as described above. Only samples with an N value 
higher than 2 were considered to be amplified. 

RSSULTS 

To validate the method, real-time PGR was performed on 
genomic DNA extracted from 103 primary breast tumors, and 18 
normal leukocyte DNA samples from some of the same patients. 
The target genes were the myc, ccndl and erbB2 proto-oncogenes, 
and the (3-amyloid precursor protein gene (app\ which maps to a 
chromosome region {I I q2 1 ,2) in which no genetic alterations have 
been found in breast tumors (Kallioniemi e( 1994). The 
reference disomic gene was die albumin gene (a/6, chromosome 
4qU-qU>. 



Validation of the standard curve and dynamic range 
ofrtal-tlme PCR 

The standard curve was constructed from PCR products serially 
diluted in genomic mouse DNA at a constant concentration of 
2 ngfyd. It should be noted chat the 5 primer pairs chosen to analyse 
the 5 target genes do not amplify genomic mouse DNA (data not 
shown), figure I shows the real-rime PCR standard curve for the 
alb gene. The dynamic range was wide (at least 4 orders of 
magnitude), with samples containing a* few as 10 1 copies or as 
many as 10 s copies. 

Copy-number ratio of (he 2 reference genes fzpp and albj 

The app to alb copy-number ratio was determined in IS normal 
leukocyte DNA samples and all 103 primary breast-tumor DNA 
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Ficua£ 1 - Albumin (alb) gene dosage by real-time PCR. Top: Amplification plots for reactions with starting alb gent copy number ranging 
from 10 3 (A9), 10* (A7), 10° (A4) to 10* (A2) and a no-tempUte control (Al). Cycic number is plotted vs. change in jtorroalized reporter signal 
(ARn). For ctveh reaction tube, the fluorescence signal of the reporter dye (FAM) is divided by the fluorescence signal of the passive reference dye 
(R.OX). to obtain a ratio defined as the normalized reporter signal (Rn). &Rn represents the normalized reporter signal (Rn) minus the baseline 
signal established in the first 15 PCR cycles. increases during PCR as a/6 PCR product copy number increases unli] the reaction readies a 
plateau, C, (threshold cycle) represents the fractional cycle number at which a significant increase in Rn above a baseline signal (horizontal black 
line) can first be detected. Two replicate plots were performed for each standard sample, but Ihe data for only one ore shown here. Bottom: 
Standard curve plotting log starring copy number vs. Q (threshold cycle). The black dots represent the data lot standard samples plotted in 
duplicate and the red dots the data foe unknown genomic DNA samples plotted in triplicate. The standard curve sho*s 4 orders of linear dynamic 
range. 
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samples. We seized these Jffi^^i^SSh^ 
chromosome regions (<¥/>• 21qZl.A «»• Hq, '^ c J. h b 
obvious generic changes (including gains « 'oss«)l J»* 
observed in breast cancers (K*U.onicrm «f at 1994). The ratio for 
normal leukocyte DNA.sampIcs fell between ^ «dU 
/„.-,„ i 02 + 021), and was similar for the 108 pnmary breast 
S nNAsamVles (0 6 to 1.6, mean 1.06 ± 0.25). confirming 
ZrT^lim^ reference ******* £ 
toast-tumor D*A. The low range of the ratios also confirmed Mh» 
Ac nucleotide sequences chosen for the primers and probes were 
^potymoihie,! mismatches of their primers or probes with the 
SSrtWA would have resulted in differential amplication. 

my<: ccndl and erb52 gene dose in normal leukocyte DM 

To determine the cut-off point for gene amplification in breast- 
cancer tissue. 18 normal leukocyte DNA samples w« teste d for 
me gene dose (N), calculated as described m Material and 
Methods". The N value of these * l £ 

/T«.V, 0 84 - 0 221 for nrvc 0.7 to 1.6 (mean 1.06 = 0.23) for 
Slo.6to? 3 £»% S 0..9)for e r6B2 M SinccK values 
fo^rcenrf; and erAB2 in normal leukocyte DNA coraistent y 
fcU between 0.5 and 1.6. values of 2 or more were considered to 
represent gene amplification in rumor DNA. 
myc. ccndl and etbB 2 gene dose in breast-tumor DM 

myc . ccndl and er*B2 gene copy numbers in the 108 pnmary 
breast tumors are reported in Table I. Extra <?P'"?^' 
more frequent (23%. 25/108) than extra copies of eW>B2 (15/. 
167108) and myc (10%. 11/108). and ranged ftom 2 to 18.6 for 
ccndl 2 w IS I for C r6B2. and only 2 to 4.6 for the myc gene. 
Figure 2 «d Table II represent tumors in which the ccndl ^ene ^ 
amolified 16-fold (T145). 6-fold (T133) and iion-amplified (Tl 18). 
ffi genes were never found to be co-amplified in the *«« 
er6B2 and ccndl were co-nmpUfied in only 3 cases, myc and ccndl 
f„ 2 cas« and myc and erbSl in 1 case. This favors ^°*«* 
that gene amplifications are independent events in 
Interestingly, 5 tumon showed a decrease of at least 50% in the 
erM32 copy number (N < 0-5). suggesting that they bore deletions 
of the 17q21 region (the site of e,oB2). No such decrease in copy 
number was observed with the other 2 protc-oncogenes. 
Comparison of gene dose determined by real-time quantitative 
PCR and Southern-blot analysis 

Southem-blot analysis of myc. ccndl and «r&B2 ^pl^cations 
had previously been done on the same 108 primary breast tumors. A 
Serfcct correlation between the results ?f real-ume PGR and 
Southern blot was obtained for tumo« w,* i h.g i copy number, 
ftj a 5 ). However, there were cases (I myc 6 ccndl and 4 eroHZ) 
\a which real-time PCR showed gene amplification whereas 
Southern-blot did not, but these were mainly cases w,m low extra 
copy numbers (N from 2 to 2.9). 

DISCUSSION 

The clinical applications of gene amplification assays arc 
currently limited, but would certainly increase »ra supple suuidard- 
S2 arid method were perfected. Gene amplification status 
h* bcen studied mainly by means of Southern b.ottmg, J bu this 
method is not sensitive enou S h to delect low-level gene amputa- 
tion nor accurate enough to quantify the full range of J^g"™ 
values. SouUiern blotting is also time-consummg. uses rad.oacuve 

TAB! £ 1 - DISTRIBUTION OF AMTLlftCATlON U^M/Ol »fc. 
[IjIaWc^I GENES IN IOB HUMAN BR£ASTTlJMOItS 



Cent 



Amplitotioii Icwl (N) 



<0J 



0.3-1.9 



S5 



myc 0 
ccndl 0 
erbBl 5 (A6V,) 



97 (89.8%) 
33 (76.9"/.) 
87 (80.6%) 



11(10.2%) 0 
17(15.7%) 8(7.4%) 
8 (7.4%) t (7.4%) 



reagents and requires relatively large amounts ol high-quality 
genomic DNA. which mean* it cannot be used rounnely in many 
uZrTtories. An amplification step is therefore required to deter- 
mine the copy number of a given target gene from minimal 
Shies of tumor DNA (small early-stage tumors, cytopuncture 
specimens or formalin-fixed, paraffin-embedded tissues). 

In this study, we validated a PCR method developed for the 
quantification of gene over-representuoon in rumors. The : mrtho4 
based on real-time analysis of PCR ampUficohon. has severa 
adages over other PCR-bascd quandtaUv* assays such as 
IS quantitative PCR (Celi et at. 1994). First, the real-time 
POT Method is performed in a closed-rubc syitem, avoiding the 
Sof c^arain^ion by amplified products. Rc-ampltficatjon of 
canyoverPCR product in subsequent experiments can Use > be 
pSted by using the enzyme uracil N-glycosylase (UNG) 
(Longo ct al 1990). The second advantage is the simplicity and 
of sample anulysU. since no post-PCR W^jtre 
required. Our results show that the automated method is reliable 
We found it possible to determine, in triplicate, the number of 
eonies^f a rarect e«e in more than 100 tumors per day. Unrd. Ae 

a^ear 8 dynamic range of at least 4 
meanine thai samples do not have to contain <:qual starting amount* 
oTSnA. SisTcUue should therefore be suitable for analyzmg 
formalin-fixed, paraffin-embedded tissues, l ourth. and above al^ 
red-time PCR makes DNA quantification much more precise and 
reproducible, since it is based on C, values rather than end-pomt 
Sasuremem of the amount of accumula.ed PCRp^duct. todeed 
the ABI Prism 7700 Sequence Detection bystcm enables C» to be 
ca cuU cdwl«n PCR amf Ufication is still in the exponen t«U ^phase 
a^d when none of the reaction components ^ rate-l.mitmg. The 
Itoin-run CV of the C value for calibrator human DNA (5 
Kolicaies) was always below 5%, and the berween-assay predion 
m 5 diSehfruns was always below 10% (data not shown). In 
addidone use of a standard curve is not absolutely necessary 
since the copy number can be determined s.mply by comparing the 
C "ratio of the «^t gene with that of reference genes The remits 
ooSed by *e 2 methods (with and without u Standard curve) are 
similar in our experiments (data no. shown) Moreover, unlike 
competitive quantitative PCR. real-time PCR does not require an 
Sal control (the design and storage of internal controls and me 
validation of their amplification efficiency is laborious). 

The only potential disavantage of i-eal-rime PCR. like all other 
PCR-bascd methods and solid-matrix blotting .techniques (SouA- 
ern blots and dot blots) is that is cannot avoid dilution artifacts 
inherent in the extraction of DNA ftom^mor cells «°"™»~» 
heterogeneous tissue specimens. Only FISH and immunohisioehem- 
?stwcur?measure alterations on a ceU-by-eell basis (Pooler* « al.. 
996 Slamon « aL 19S9). However. RSH requires expensive 
equipment and trained personnel and .s also .7^™"^ 
Moreover. FISH does not assess gene expression and therefore 
cannot detect cases in which the gene product is over-expressed in 
the absence of gene amplification, which will be possible m me 
future by real-time quantitative RT-PCR. ^^^^^ 
ubject to considerable variations in the hands of differeot ^teams 
owing to alterations of target proteins during the procedure Ae 
Sent primary antibodies and fixation methods used and the 
criteria used to define posmve staining. 

The results of mis study are in agreement wi th those ^reported .n 
the literature. (0 Chromosome regions 4qll-ql3 and 21q21.<l 
whichTear alb and app. respectively) showed no genetic ^r- 
atiow in the breast-cancer samples tidied ^re m keeping wi& 
the results of CGH (Kal.ioniemi ct al.. ^^^X 
amplifications of these 3 oncogenes were "d^^f " 
reported by other teams (Bems et al.. 199?.; Berg et al 1992). >.) 
T^rreque'ncy and degree ofmyc amplification '"f^^ 
DNA series were lower man those of ccndl and er6B2 amplirtca- 
Ln conWng the findings of Borg et al. (1992) and Courts! et aL 
1997) ( v) The maxima If ccndl and c,6B2 °*«-«P™ m *«™ 
were 8-fold and 1 5-fold, also in keepinp. with earlier results (about 
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Cycle 




0 2 4 e 8 10 12 14 16 16 20 22 24 26 28 30 32 34 36 38 *0. faporcw: | f-AM -> | 



Cycle 



Tumor 
I T118 
T133 
T145 



CCND1 



ALB 



C\ Copy number c t Copy number 
27.3 4605 26.5 4365 



23-2 



22.1 



61659 
125892 



25.2 



25.6 



10092 



7762 



Figl'rk 2 - ccndl and alb gene dosage by real-time PCR in 3 breast tumor samples: T U 8 (EI2 ( C6, black squares), Tl 3 3 (G U . B4, red squares) 
and T145 (A3, C8, blue square?). Given the C t of each sample, the initial copy number is inferred from the standard curve obtained during the same 
experiment Triplicate plots were performed for each tumor sample, but the data for only one are shown here. The rcsulis art shown in Table II. 



30-fold maximum) (Berns et aL 1 992; Borg et ai., 1 992; Couijal ct 
aL 1997), (v) The erbB2 copy numbers obtained with real-time 
PCR were in good agreement with data obtained with other 
quantitative PCR-bascd assays in terms of die frequency and 
degree of amplification (An et aL 1995; Deng et ctL 1996; Valeron 



et cl, 1996). Our results also correlate well with those recently 
published by Gelmini et at (1997), who used iheTaqMan system to 
measure erfcB2 amplification in a small scries of breast tumors 
(n =a 25), but with an instrument (LS-50B luminescence spectrom- 
eter, PcTkin-Elmer Applied Biosystems) which only allows end- 
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TAStt II - EXAMPLES OF centSt GENE OOSaCE RESULTS 

raoM 3 breast tumors 1 



Tumor 




CCfidi 






cih 




)4awH/atb 


Copy 
numbaf 


Mom 


so 


Com 
number 


Mean 


so 


TU6 


4525 






4223 






1.06 




4605 


*603 


77 


4365 


4325 


89 




4678 






4387 








T133 


59821 






9787 




375 


6.03 




61659 


61100 


nil 


10092 


10137 




61821 






10533 










128563 






7321 






16,34 


125392 


125392 


3448 


7762 


7672 


316 




121722 






7933 









! For each sample, 3 replicate experiments were performed and the mean 
and the standard deviation (SD) was determined. The level otccndl gene 
amplification (Nccndi/atb) is determined by dividing the average cendi 
copy number value by the average alb copy number value. 



point measurement of fluorescence intensity. Here we report myc 
and ccmil jene dosage in breast cancer by means of quantitative 
PCR. (vi) We found a high degree of concordance between 
real- rime quantitative PCR and Southern blot analysis in terms of 
gene amplification, especially for samples with high copy numbers 
(^5-fold). The slightly higher frequency of gene amplification 
(especially ccndl and er6B2) observed by means of real-time 
quantitative PCR as compared with Souihem-bioc analysis may be 
explained by the higher sensitiviry of the former method. However, 
we cannot rule out the possibility that some tumors with a few extra 



gene copies observed in real-time PCR had additional copies of an 
arm or a whole chromosome (trisomy, telrasomy or polyeomy) 
rather than true gene amplification. These 2 types of genetic 
alteration (polysomy and gene amplification) could be easily 
distinguished in the future by using an additional probe located on 
the same chromosome arm, but some distance from the target gene. 
It is noteworthy that high gene copy numbers have the greatest 
prognostic significance in breast carcinoma (Borg et al t 1992; 
Slamonera/., 1987). 

Finally, this technique can be applied to the detection of gene 
deletion as well as gene amplification. Indeed, we found a 
decreased copy number of erbhl (but not of the other 2 proto- 
oncogenes) in several rumors; cr6B2 is looalcd in a chromosome 
region (I7q2l) reported to contain both deletions and amplifica- 
tions in breast cancer (Bicchc and Lidcrcau, 1995). 

In conclusion, gene amplification in various cancers can be used 
as a marker of pre-neopiasia, also for early diagnosis of cancer, 
staging, prognostication and choice of treatment Southern blotting 
is not sufficiently sensitive, and FISH is lengthy and complex. 
Real-time quantitative PCR overcomes both these limitations, and 
is a sensitive and accurate method of analyzing large numbers of 
samples in a. short time. It should find a place in routine clinical 
gene dosage. 
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< first sequence: pl.Drcfl^804 (length = 520) 
<second sequence: pl.i^lzman (length = 673) 



APPENDIX B 



<597 matches in an overlap of 598: 99/83 percent similarity 
<gaps in first sequence: 1 (75 residues), gaps in second sequence: 0 
<score: 2895 (Dayhoff PAM 250 matrix, gap penalty =8+4 per residue) 
<endgaps not penalized 



pl.DNA44804 
pi. holtzman 



10 20 30 40 50 60 

MCSRVPLLLPLLLL1^IX5PGVQGCPSGCQCSQPQTVFCTARQGTTVPRDVPPDTVGLYVF 
***************************** 

MCSRVPLLLPLLLLLALGPGVQ 

10 20 30 40 50 60 



pl.DNA44804 
pi. holtzman 



70 80 90 

ENGITMLDASSFAGLPGLQLLDLSQNQIAS 

***************************** 

ENGITMLDAGSFAGLPGLQLLDLSQNQIASLPS'GVFQPLANLSNLDLT^RLHEITN 

70 80 90 ^100^ " ^ 110 " "120 



pl.DNA44804 
pl.holtzman 



100 

. _ LRLPRLLLLDLSHNS 

*************** 

RGLRRLgRLJ^KNRlRHIQPGAFDT^^ 

130 ' 140 ' 150 ' ~160 170 " 180 



pl.DNA44804 
pi .holtzman 



110 120 130 140 150 160 

LI^EPGILDTAWEALRLAG]^LCX2LDEGLFSRLRNLHDLDVSDNQLERVPPVIRGLRG 
************************************************************ 

LLALEPGILDTANVEALRLAGLGLQQLDEGLFSRLRNLHDLDVSDNQLERVPPVIRGLRG 
190 200 210 220 230 240 



pl.DNA44804 
pi .holtzman 



170 180 190 200 210 220 

LTRLRLAGNTRIAQLRPEDLAGLAAIjQ 

************************************************************ 

LTRLRl^GNTRIAQLRPEDI^GLAALQELDVSNLSLQALPGDLSGLFPRLRLLAAARNPF 
250 260 270 280 290 300 



pl.DNA44804 
pl.holtzman 



230 240 250 260 270 280 

NCVCPLSWFGPWVRESHVTLASPEETRCHFPPKNAGRLLLELDYADFGCPATTTTATVPT 
************************************************************ 

NCVCPLSWFGPWVRESHWLASPEETRCHFPPKNAGRLLLELDYADFGCPATTTTATVPT 
310 320 330 340 350 360 



pl.DNA44804 
pi .holtzman 



290 300 310 320 330 340 

TRPVWEPTALSSSLAPTWLSPTAPATEAPSPPSTAPPTVGPVPQPQDCPPSTCLNGGTC 
************************************************************ 

TRPVWEPTALSSSLAPTWLSPTAPATEAPSPPSTAPPTVGPVPQPQDCPPSTCLiNGGTC 
370 380 390 400 410 420 



pl.DNA44804 
pi .holtzman 



350 360 370 380 390 400 

H^TRHHLACLCPEGFTGLYCESQMGQGTRPSPTPVTPRPPRSLTIiGIEPVSPTSLRVGL 
************************************************************ 

HLGTRHHLACLCPEGFTGLYCESQMGQGTRPSPTPVTPRPPRSLTLGIEPVSPTSLRVGL 
430 440 450 460 470 480 



pl.DNA44804 
pi .holtzman 



410 420 430 440 450 460 

QRYLQGSSVQLRSLRLTYRKLSGPDKRLVTLRLPASI^EYTVTQLRPNATYSVCVMPLiGP 
************************************************************ 

QRYLQGSSVQLRSLRLTYRNLSGPDKRLVTLRLPASI^EYTVTQLRPNATYSVCVMPLGP 
490 500 510 520 530 540 



470 



480 



490 



500 



510 



520 



pi . DNA44 8 04 GRVPEGF" ^EEAHTPPAVHSNHAPVTQAREGNLP T PALAAVLLAALAAVGAAYCVR 
* ******* .^P£************************ • ********************* 

pi holtzman GRVPEGEEACGELAHTPPAVHSNHAPVTQAREGNLPLLIAPALAAVLLAALAAVGAAYCVR 

550 560 570 580 590 600 

530 540 550 560 570 580 

pi . DNA44 804 RGRAMAAAAQDKGQVGPGAGPLELEGVKVPLEPGPKATEGGGEALPSGSECEVPLMGFPG 
************************************************************ 
pi . holtzman RGRAMAAAAQDKGQVGPGAGPLELEGVKVPLEPGPKATEGGGEALPSGSECEVPLMGFPG 

610 620 630 640 650 660 

590 

p 1 . DNA4 4804 PGLQS PLHAKPYI 
************* 

pi . holtzman PGLQS PLHAKPYI 

670 



Sequence file : /home/j r^Wva/Molbio/carpenda/temp ." ie/pl .holtzman 
motifs in /usr/local/se^y libdata/motif .pro 

Motif name: N-glycosylat ion site. 
Accession: PS00001; 
Motif: N[!P] [ST] [ ! P] 

101 NLSN 

117 NETF 

273 NLSL 

500 NLSG 

528 NATY 



Sequence file: /home/i ^^va/Molbio/carpenda/temp . ie/pl.DNA44804 
motifs in /usr/local/seq/libdata/motif .pro 

Motif name: N-glycosylation site. 
Accession: PS00001; 
Motif: N[!P] [ST] [ ! P] 

198 NLSL 
425 NLSG 
453 NATY 



HMM file: 
Sequence file: 



/usr/seqdb/pfam/Pfam_ls 
pl.DNA44804 



Query: DNA44804 [598 aa] 



Model Description 

LRR Leucine Rich Repeat 

LRRCT Leucine rich repeat C- terminal domain 

EGF EGF-like domain 

LRRNT Leucine rich repeat N- terminal domain 

fn3 Fibronectin type III domain 



Score 


E 


-value 


N 


59.2 


8 


.8e-14 


7 


47.1 




4e-10 


1 


30.0 


5 


.4e-05 


1 


29.8 


6 


.5e-05 


1 


13.0 




0. 15 


1 



Parsed 
Model 


for domains: 
Domain seq-f 


seq-t 


hmm-f 


hmm-t 


score 


E -value 


LRRNT 


1/1 


23 


51 . 


1 


31 [] 


29.8 


6.5e-05 


LRR 


1/7 


53 


76 . 


1 


25 [] 


5.7 


2.1e+02 


LRR 


2/7 


77 


102 . 


1 


25 [] 


9.4 


65 


LRR 


3/7 


118 


141 . 


1 


25 [] 


10.4 


44 


LRR 


4/7 


142 


164 . 


1 


25 [] 


19.1 


0.1 


LRR 


5/7 


165 


189 . 


1 


25 [] 


11.1 


26 


LRR 


6/7 


190 


212 . 


1 


25 [] 


12.3 


12 


LRRCT 


1/1 


223 


275 . 


1 


54 [] 


47.1 


4e-10 


EGF 


1/1 


334 


366 . 


1 


45 [] 


30.0 


5.4e-05 


LRR 


7/7 


415 


437 . 


1 


25 [] 


3.1 


4.8e+02 


fn3 


1/1 


383 


474 . 


1 


84 [] 


13.0 


0.15 



HMM file: 
Sequence file: 



/usr/seqdb/pfam/Pf am_ls 
pl.holtzman 



Query: holtzman [673 aa] 

Scores for sequence family classification (scoi 

Model Description 

LRR Leucine Rich Repeat 

LRRCT Leucine rich repeat C-terminal domain 

EGF EGF- like domain 

LRRNT Leucine rich repeat N- terminal domain 

fn3 Fibronectin type III domain 



Score 


E 


-value 


N 


108.8 




le-28 


11 


47.1 




4e-10 


1 


30.0 


5 


.4e-05 


1 


29.8 


6 


,5e-05 


1 


13.0 




0.15 


1 



Parsed 
Model 


for domains : 
Domain seq-f 


seq-fc 


nnufl - £ 


lillllli C 




R— value 

L-J V Ca. J_ — 


LRRNT 


1/1 


23 


51 . 


1 


31 [] 


29.8 


6.5e-05 


LRR 


1/11 


53 


76 . 


. 1 


25 [] 


6.1 


1. 9e+02 


LRR 


2/11 


77 


100 . 


1 


25 [] 


21.6 


0.019 


LRR 


3/11 


101 


124 . 


1 


25 [] 


15.6 


1.2 


LRR 


4/11 


125 


148 . 


1 


25 [] 


18.1 


0.21 


LRR 


5/11 


149 


169 . 


1 


25 [] 


9.7 


58 


LRR 


6/11 


170 


192 . 


1 


25 [] 


6.1 


1. 8e+02 


LRR 


7/11 


193 


216 . 


1 


25 [] 


10.4 


44 


LRR 


8/11 


217 


239 . 


1 


25 [] 


19.1 


0.1 


LRR 


9/11 


240 


264 . 


1 


25 [] 


11.1 


26 


LRR 


10/11 


265 


287 . 


1 


25 [] 


12.3 


12 


LRRCT 


1/1 


298 


350 . 


1 


54 [] 


47.1 


4e-10 


EGF 


1/1 


409 


441 . 


1 


45 [] 


30.0 


5.4e-05 


LRR 


11/11 


490 


512 . 


1 


25 [] 


3.1 


4 . 8e+02 


fn3 


1/1 


458 


549 . 


1 


84 [] 


13.0 


0.15 
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Id proteins antagonize basic helix-loop-helix pro- 
teins, inhibit differentiation, and enhance cell prolif- 
eration. In this study we compared the expression of 
Id-1, Id-2, and Id-3 in the normal pancreas, in pan- 
creatic cancer, and in chronic pancreatitis (CP). 
Northern blot analysis demonstrated that all three Id 
mRNA species were expressed at high levels in pan- 
creatic cancer samples by comparison with normal or 
CP samples. Pancreatic cancer cell lines frequently 
coexpressed all three Ids, exhibiting a good correla- 
tion between Id mRNA and protein levels, as deter- 
mined by immunoblotting with highly specific anti-Id 
antibodies. Immunohistochemistry using these anti- 
bodies demonstrated the presence of faint Id-1 and 
Id-2 immunostaining in pancreatic ductal cells in the 
normal pancreas, whereas Id-3 immunoreactivity 
ranged from weak to strong. In the cancer tissues, 
many of the cancer cells exhibited abundant Id-1, 
Id-2 , and Id-3 immunoreactivity. Scoring on the basis 
of percentage of positive cells and intensity of immu- 
nostaining indicated that Id-1 and Id-2 were increased 
significantly in the cancer cells by comparison with 
the respective controls. Mild to moderate Id immuno- 
reactivity was also seen in the ductal cells in the 
CP-like areas adjacent to these cells and in the ductal 
cells of small and interlobular ducts in CP. In con- 
trast, in dysplastic and atypical papillary ducts in CP, 
Id-1 and Id-2 immunoreactivity was as significantly 
elevated as in the cancer cells. These findings suggest 
that increased Id expression may be associated with 
enhanced proliferative potential of pancreatic cancer 
cells and of proliferating or dysplastic ductal cells in 
CP. (Am J Pathol 1999, 155:815-822) 



Basic helix-loop-helix (bHLH) proteins play an important 
roie as transcription factors in cellular development, pro- 
liferation, and differentiation. 1,2 The basic domain of the 
bHLHs is required for binding to an E-box DNA se- 
quence, thus promoting transcription of specific target 
genes. The HLH domain promotes dimer formation with 
various members of the bHLH protein family. 1,2 Ho- 
modimers of the class B family of bHLH proteins, includ- 
ing MyoD, NeuroD, and numerous other proteins, are 
known to activate tissue-specific genes 3-5 These tissue- 
specific bHLHs typically form heterodimers with widely 
expressed class A bHLHs, which include proteins en- 
coded by E2A, E2-2, HEB, and other genes (also termed 
E-proteins). 6-9 These heterodimers activate transcription 
of genes that are associated with differentiation. 

Id genes encode a family of four HLH proteins that tack 
the basic DNA binding domain. 1,10 They act as dominant- 
negative HLH proteins by forming high affinity het- 
erodimers with other bHLH proteins, thereby preventing 
them from binding to DNA and inhibiting transcription of 
differentiation-associated genes. 10 " 12 Id gene expres- 
sion is down-regulated on differentiation in many cell 
types in vitro and in vivo. 13 ~ 18 In addition, Id proteins seem 
to be required for cell cycle progression through G^S 
phase in certain cell types, and interaction between Id-2 
and pRB is associated with enhanced proliferation in 
some cell lines in vitro. 19-23 

Pancreatic cancer is the fifth leading cause of cancer 
death in the United States, with a mortality rate that vir- 
tually equals its incidence rate. 24 This malignancy is often 
associated with the overexpression of a variety of mito- 
genic growth factors and their receptors, and by onco- 
genic mutations of K-ras and inactivation of the p53 tumor 
suppressor gene 25 We have recently reported that pan- 
creatic cancers overexpress the HLH protein Id-2, and 
that enhanced expression of this protein is evident in the 
cytoplasm of the cancer cells within the pancreatic tumor 
mass. 26 It is not known, however, whether the expression 
of other Id proteins is altered in this malignancy, or 
whether their expression is altered in chronic pancreatitis 
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(CP), an inflammatory disease that is characterized by 
dysplastic ducts, foci of proliferating ductal cells, acinar 
ceil degeneration, and fibrosis 27 We now report that 
there is a five- to sixfold increase in ld-1 and ld-2 mRNA 
levels and a twofold increase in ld-3 mRNA levels in 
pancreatic cancer by comparison with the normal pan- 
creas. In contrast, overall Id mRNA levels are not in- 
creased in CP. 



Patients and Methods 

Normal human pancreatic tissue samples from 7 male 
and 5 female donors (median age 41.8 years, range 
14-68 years), CP tissues from 13 males and 1 female 
(median age 42.1 years; range 30-56 years), and pan- 
creatic cancer tissues from 1 0 male and 6 female donors 
(median age 62.6 years; range 53-83 years) were ob- 
tained through an organ donor program and from surgi- 
cal specimens from patients with severe symptomatic 
chronic pancreatitis or pancreatic cancer. A partial 
duodenopancreatectomy (Whipple/pylorus-preserving 
Whipple; n - 13), a left resection of the pancreas (n = 2), 
or a total pancreatectomy (n = 1 ) were carried out in the 
pancreatic cancer patients. According to the TNM clas- 
sification of the Union Internationale Contre le Cancer 
(UICC) 6 tumors were stage 1 , 1 was stage 2, and 9 were 
stage 3 ductal cell adenocarcinoma. Freshly removed 
tissue samples were fixed in 10% formaldehyde solution 
for 12 to 24 hours and paraffin-embedded for histological 
analysis. In addition, tissue samples were frozen in liquid 
nitrogen immediately on surgical removal and maintained in 
-80°C until use for RNA extraction. All studies were ap- 
proved by the Ethics Committee of the University of Bern, 
Bern, Switzerland, and by the Human Subjects Committee 
at the University of California, Irvine, California. 

Northern Blot Analysis 

Northern blot analysis was carried out as described pre- 
viously. 26,28 Briefly, total RNA was extracted by the single 
step acid guanidinium thiocyanate phenol chloroform 
method. RNA was size-fractionated on 1 .2% agarose/1.8 
mol/L formaldehyde gels, electrotransferred onto nylon 
membranes, and cross-linked by UV irradiation. Blots 
were prehybridized and hybridized with cDNA probes 
and washed under high stringency conditions. The fol- 
lowing cDNA probes were used: a 979-bp human ld-1 
cDNA probe, a 440-bp human ld-2 cDNA probe, and a 
450-bp human ld-3 cDNA probe, covering the entire 
coding regions of ld-1, ld-2, and ld-3, respectively. A 
BamHI 190-bp fragment of mouse 7S cDNA that hybrid- 
izes with human cytoplasmic RNA was used to confirm 
equal RNA loading and transfer. Blots were then exposed 
at -80°C to Kodak BioMax-MS films and the resulting 
autoradiographs were scanned to quantify the intensity of 
the radiographic bands 26 28 For each sample the ratio of 
Id mRNA expression to 7S expression was calculated. To 
compare the relative increase in expression of the re- 
spective Id mRNA species in the cancer and CP sam- 
ples, the same normal samples were used for normal/ 
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Figure 1. mRNA expression of ld-1, ld-2, and ld-3 in pancreatic cancer and 
chronic pancreatitis. Total RNA (20 fig/lane) from six normal, eight cancer- 
ous, and seven chronic pancreatitis tissue samples were subjected to North- 
ern blot analysis using - 12 P-labeled cDNA probes (500,000 cpm/ml) specific 
for ld-1, ld-2, and ld-3, respectively. A 7S cDNA probe (50,000 cpm/ml) was 
used as a loading and transfer control. Exposure times of the normal/cancer 
blots were 1 day for all Id probes, and 2 days for the normal/CP blots. 
Exposure time was 4 hours for mouse 7S cDNA. By comparison with die 
normal samples, ld-1 and ld-3 mRNA levels were elevated in 8 and 9 cancer 
samples, respectively, whereas Id -2 was elevated in 6 cancer samples. 

cancer and normal/CP membranes. The median score for 
ld-1, ld-2, and ld-3 mRNA levels in these normal samples 
was set to 100. Statistical analysis was performed with 
SigmaStat software (Jandel Scientific, San Raphael, CA). 
The rank sum test was used, and P < 0.05 was taken as 
the level of significance. 

Cell Culture and Western Blot Analysis 

PANC-1. MIA-PaCa-2, ASPC-1, and CAPAN-1 human 
pancreatic cell lines were obtained from ATCC (Manas- 
sas, VA). COLO-357 human pancreatic cells were a gift 
from Dr. R. S. Metzger (Durham, NC). Cells were routinely 
grown in DMEM (COLO-357, MIA-PaCa-2, PANC-1) or 
RPMI (ASPC-1, CAPAN-1) supplemented with 10% fetal 
bovine serum, 100 U/ml penicillin, and 100 iigfm\ strep- 
tomycin. For immunoblot analysis, exponentially growing 
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Figure 2. Densitometry analysis of Northern blots. Autoradiographs of 
Northern blots from 12 normal, 14 CP, and 16 pancreatic cancers were 
analyzed by densitometry. mRNA levels were determined by calculating the 
ratio of the optical density for the respective Id mRNA species in relation to 
the optical density of mouse 7S cDNA. To compare the relative increase in 
expression of the respective Id mRNA species in the cancer and CP samples, 
the same normal samples were used for normal/cancer and normal/CP 
membranes. Normal pancreatic tissues are indicated by circles, CP tissues by 
triangles, and cancer tissues by squares. Data are expressed as median 
scores ± SD. By comparison with the normal samples, only the cancer 
samples exhibited significant increases: 6.5-fold (P< 0.01) for Id-1 T fivefold 
(P < 0.01) for Id-2, and twofold (P = 0.027) for Id-3. 



cells (60-70% confluent) were solubilized in lysis buffer 
containing 50 mmol/L Tris-HCI, pH 7.4, 150 mmol/L NaCI, 
1 mmol/L EDTA, 1 ptg/ml pepstatin A, 1 mmol/L phenyl- 
methylsulfonyl fluoride (PMSF), and 1% Triton X-100. Pro- 
teins were subjected to sodium dodecyl sulfate polyacryl- 
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Figure 3. Id mRNA and protein expression in pancreatic cancer cell lines. 
Upper panels: Total RNA (20 /ig/lane) from 5 pancreatic cancer cell lines 
were subjected to Northern blot analysis using 32 P-labeled cDNA probes' 
(500,000 cpm/ml) specific for ld-1, ld-2, and Id-3, respectively. Exposure 
times were 1 day for all Id probes. Lower panels: Immunoblotting. Cell 
lysates (30 /ig/lane) were subjected to SDS-PAGE. Membranes were probed 
with specific ld-1, ld-2, and Id-3 antibodies. Visualization was performed by- 
enhanced chemiluminescence. 



amide gel electrophoresis (SDS-PAGE), transferred to 
Immobilon P membranes, and incubated for 90 minutes 
with the indicated antibodies and for 60 minutes with 
secondary antibodies against rabbit IgG. Visualization 
was performed by enhanced chemiluminescence. 



Immunohistochemistry 

Specific rabbit anti-human ld-1 (C-20), ld-2 (C-20), and 
Id-3 (C-20; all from Santa Cruz Biotechnology, Santa 
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Figure 4. Normal and cancerous pancreatic tissues were subjected to immu- 
nostaining using higlily specific anti-Id- 1 (A-C), anti-Id-2 (D-F), and anti-Id-3 
(G-l) antibodies as described in die Methods section. Moderate to strong Id-1 
immunoreactiviry was present in the cytoplasm of duct-like cancer cells (A 
and C, left panel). In the normal pancreas there was weak Id-1 immunore- 
activiry in the ductal cells (B). Preabsorption with the Id-1 -specific blocking 
peptide abolished the Id-1 immunoreactivity (C, right panel). Strong Id-2 
immunoreactivity was observed in the cytoplasm of the cancer cells that 
exhibited duct-like structures (D and F, left panel), whereas in the normal 
pancreas, there was only weak Id-2 immunoreactivity in the ductal cells (E). 
Preabsorption with the Id-2-specific blocking peptide alwlished the Id-2 
immunoreactivity (F, right panel). Moderate to strong Id-3 immunoreactivity 
was present in the duct-like cancer cells (G and I, left panel). Moderate to 
strong Id-3 immunoreactivity was also present in the ductal cells of normal 
pancreatic tissue samples (H). Id-3 immunoreactivity was completely abol- 
ished by preabsorption with the Id-3 specific blocking peptide (I, right 
panel). A, D, and G constitute serial sections of a pancreatic cancer sample, 
revealing coexpression of the three Id proteins. Scale bars, 25 jx m - 

Cruz, CA) polyclonal antibodies were used for immunhis- 
tochemistry. These affinity-purified rabbit polyclonal anti- 
bodies specifically react with Id-1, Id-2, and Id-3, respec- 
tively, of human origin, as determined by Western 
blotting. Paraffin-embedded sections (4 /xm) were sub- 
jected to immunostaining using the streptavidin-peroxi- 
dase technique. Where indicated, immunostaining for all 
three Id proteins was performed on serial sections. En- 



dogenous peroxidase activity was blocked by incubation 
for 30 minutes with 0.3% hydrogen peroxide in methanol. 
Tissue sections were incubated for 15 minutes (23°C) 
with 10% normal goat serum and then incubated for 16 
hours at 4°C with the indicated antibodies in PBS con- 
taining 1% bovine serum albumin. Bound antibodies 
were detected with biotinylated goat anti-rabbit IgG sec- 
ondary antibodies and streptavidin-peroxidase complex, 
using diaminobenzidine tetrahydrochloride as the sub- 
strate. Sections were counterstained with Mayer's hema- 
toxylin. Preabsorption with Id-1-, Id-2-, or ld-3-specific 
blocking peptides completely abolished immunoreactiv- 
ity of the respective primary antibody. The immunohisto- 
chemical results were semiquantitatively analyzed as de- 
scribed previously. 29,30 The percentage of positive 
cancer cells was stratified into four groups: 0, no cancer 
cells exhibiting immunoreactivity; 1 , <33% of the cancer 
cells exhibiting immunoreactivity; 2, 33 to 67% of the 
cancer cells exhibiting immunoreactivity; 3 >67% of the 
cancer cells exhibiting immunoreactivity. The intensity of 
the immunohistochemical signal was also stratified into 
four groups: 0, no immunoreactivity; 1 , weak immunore- 
activity; 2, moderate immunoreactivity; 3, strong immu- 
noreactivity. Finally, the sum of the results of the cell 
score and the intensity score was calculated. Statistical 
analysis was performed with SigmaStat software. The 
rank sum test was used, and P < 0.05 was taken as the 
level of significance. 



Results 

Northern blot analysis of total RNA isolated from 12 nor- 
mal pancreatic tissues and 16 pancreatic cancers re- 
vealed the presence of the 1 .2-kb Id-1 transcript and the 
1.6-kb Id2 mRNA transcript in 11 of the 12 normal pan- 
creatic samples, and the 1 .3-kb Id-3 mRNA transcript in 
all normal pancreatic samples (Figure 1A, 2). In the can- 
cer tissues, Id-1 mRNA levels were elevated in 8 of 16 
samples, Id-2 mRNA levels were elevated in 9 of these 
samples, and Id-3 mRNA levels were elevated in 6 of 
these samples (Figure 1A, 2). Concomitant overexpres- 
sion of all three Id species was observed in 6 of the 
cancer samples (38%). In contrast, none of the Id mRNA 
species were overexpressed in CP by comparison with 
norma! controls (Figure 1B, 2). Densitometry analysis of 
all of the autoradiograms indicated that there was a 6.5- 
fold increase (P < 0.01) in Id-1 mRNA levels, a fivefold 
increase (P < 0.01) in Id-2 mRNA levels, and a twofold 
increase (P = 0.027) in Id-3 mRNA levels in the pancre- 
atic cancer samples in comparison to normal controls 
(Figure 2). In contrast, there was no statistically signifi- 
cant difference in the expression levels of Id-1, Id-2, and 
Id-3, in CP tissues in comparison to the corresponding 
levels in the normal pancreas (Figure 2). 

Next, we assessed the expression of the three Id 
genes in 5 human pancreatic cancer cell lines by North- 
ern and Western blot analyses. Id-1 mRNA was present 
at varying levels in all 5 cell lines (Figure 3). ASPC-1, 
CAPAN-1, MIA-PaCa-2, and PANC-1 expressed moder- 
ate to high levels of Id-1 mRNA, whereas COLO-357 cells 
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Figure 5. Immunohistochemistry of pancreatic cancer arid dysplastic ducts in CP tissues. In the pancreatic cancer tissues (A-C) there was moderate to strong Id-1 
(A), ld-2 (B), and Id-3 (C) immunoreactivity in the ductal cells in the areas adjacent to the cancer cells that exhibited CP-like alterations. Islet cells did not exhibit 
Id immunoreactivity (outlined by solid arrowheads). In the CP samples, moderate to strong Id-1 (D), Id-2 (E), and Id-3 (F) immunoreactivity was present in the 
cytoplasm of epithelial cells forming large dysplastic ducts. Scale bar, 25 p.™- 



expressed relatively low levels of this mRNA moiety. 
Western blotting with a highly specific anti-ld-1 antibody 
confirmed the presence of the approximately 14-kd Id-1 
protein in the 4 ceil lines that expressed high levels of 
Id-1 mRNA (Figure 3). Furthermore, the three cell lines 
with the highest Id-1 mRNA expression (CAPAN-1, MIA- 
PaCa-2, and PANC-1) also exhibited the highest Id-1 
protein expression. Variable levels of the 1.6-kb ld-2 
mRNA transcript were present in all 5 cell lines. In addi- 
tion, a minor band of approximately 1.2 kb was visible in 
COLO-357 and MIA-PaCa-2 cells. Immunoblot analysis 
with a highly specific anti-ld-2 antibody revealed two 
bands of approximately 16 and 18 kd at relatively high 
levels in all of the cell tines with exception of PANC-1 
cells, in which the 16-kd band was relatively faint (Figure 
3). With the exception of MIA-PaCa-2 cells, there was a 
good correlation between ld-2 mRNA and protein levels 
(Figure 3). Id-3 mRNA was present at high levels in 
MIA-PaCa-2 cells, at moderate levels in COLO-357 cells, 
and at low levels in PANC-1 cells. Id-3 mRNA was not 
detectable in ASPC-1 and CAPAN-1 cells (Figure 3). 
Immunoblot analysis with a highly specific anti-1d-3 anti- 
body revealed an approximately 14-kd band that was most 
abundant in MIA-PaCa-2 cells, and was also readily appar- 
ent in COLO-357 and PANC-1 cells. In contrast, only a faint 
Id-3 band was seen in ASPC-1 and CAPAN-1 cells. Thus, 
with the exception of PANC-1 cells, there was a good cor- 
relation between Id-3 mRNA and protein levels. 



To determine the localization of Id-1, ld-2, and Id-3, 
immunostaining was carried out using the same highly 
specific anti-Id antibodies. In the pancreatic cancers, 
moderate to strong Id-1 immunoreactivity was present in 
the cancer cells in 9 of 10 randomly selected cancer 
samples. An example of moderate Id-1 immunoreactivity 
is shown in Figure 4A, and of strong immunoreactivity in 
Figure 4C (left panel). In contrast, in the normal pancreas, 
faint Id-1 immunoreactivity was present only in the ductal 
cells of pancreatic ducts (Figure 4B, arrowheads). Pre- 
absorption with the Id-1 -specific blocking peptide com- 
pletely abolished the Id-1 immunoreactivity (Figure 4C, 
right panel). The cancer cells also exhibited strong ld-2 
(Figure 4, D and F, left panel) and moderate to strong Id-3 
immunoreactivity. An example of moderate Id-3 immuno- 
reactivity is shown in Figure 4G, and of strong immuno- 
reactivity in Figure 41 (left panel). In contrast, only faint 
ld-2 immunoreactivity was present in the ductal cells in 
the normal pancreas (Figure 4E), whereas Id-3 immuno- 
reactivity in these cells was more variable and ranged 
from moderate to occasionally strong (Figure 4H). Islet 
cells and acinar cells were always devoid of Id immunore- 
activity. Preabsorption of the respective antibody with the 
blocking peptides specific for ld-2 (Figure 4F, right panel) 
and Id-3 (Figure 41, right panel) completely abolished im- 
munoreactivity. Analysis of serial pancreatic cancer sec- 
tions revealed that there was often colocalization of the 
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Figure 6. [mmunohistocheniistry of atypical papillary epithelium in CP tissues. Serial section analysis of some CP samples revealed the presence of large duct-iike 
structures with atypical papillary epithelium. Mild to moderate ld-1 (A) and Id-2 (B) immunoreactivity and weak Id-3 (C) immunoreactivity was present in the 
cytoplasm of the cells forming these large ducts with papillary structures. Some CP samples also exhibited moderate Id-3 immunoreactivity in these cells (D). Scale 
bar, 25 /xm. 



three Id proteins. An example of serial sections from a 
pancreatic cancer tissue is shown in Figure 4, A, D, and G. 

ld-1, ld-2, and Id-3 immunoreactivity was also present 
at moderate levels in the cytoplasm of ductal cells within 
CP-like areas adjacent to the cancer cells (Figure 5, A-C). 
As in the normal pancreas, islet cells (outlined by arrow- 
heads) did not exhibit Id immunoreactivity. In 4 of 9 CP 
samples, there were foci of ductal cell dysplasia of rela- 
tively large interlobular ducts, all of which exhibited mod- 
erate to strong ld-1, ld-2, and Id-3 immunoreactivity (Fig- 
ure 5, D-F). Five of 9 CP samples also contained foci of 
large ducts exhibiting atypical papillary epithelium. Serial 
section analysis of one of those CP samples revealed 
mild to moderate ld-1 and ld-2 immunoreactivity and 
weak Id-3 immunoreactivity in the cells of these atypical 
papillary ducts (Figure 6, A-C). In contrast, in some of 
these CP samples, moderate to strong Id-3 immunoreac- 
tivity was also observed (Figure 6D). However, most of 
the ductal cells forming the typical ductular structures of 
CP, such as large interlobular ducts and small proliferat- 
ing ducts, exhibited generally only weak to occasionally 
moderate Id immunoreactivity (data not shown). 



The immunohistochemica! data for ld-1, ld-2, and Id-3 
are summarized in Table 1. In the case of ld-1 and ld-2, 
the cancer cells as well as the dysplastic and atypical 
papillary ducts in CP exhibited a significantly higher 
score than the ductal cells in the normal pancreas. In 
contrast, due to the marked variability in Id-3 immuno- 
staining in the normal pancreas, the differences between 
normal and cancer cells and normal and dysplastic cells 
did not achieve statistical significance. 



Discussion 

Id proteins constitute a family of HLH transcription factors 
that are important regulators of cellular differentiation and 
proliferation. 1,2 To date, four members of the human Id 
family have been identified. 1,10 " 12 Their expression is 
enhanced during cellular proliferation and in response to 
mitogenic stimuli, 19,31 and overexpression of Id genes 
inhibits differentiation and/or enhances proliferation in 
several different cell types. 15,32 " 34 The forced expression 
of ld-1 in mouse small intestinal epithelium results in 
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Table 1. Histological Scoring 









ld-1 


ld-2 


ld-3 


Normal {n - 
Cancer (n = 
CP (n = 9) 


6) 
10) 


Ductal cells 
Cancer cells 

Typical CP lesions (n - 9) 
Dysplastic ducts (n = 4) 
Atypical papillary ducts (n = 5) 


2.0 ± 0.4 
4.5* ± 0.5 

2.7 ± 0.5 
5.3* ± 0.2 
4.4* ± 0.2 


2.3 ± 0.2 
5.2 § ± 0.3 

3.1 ± 0.6 
5.8* ± 0.2 
5.2* ± 0.2 


2.5 ± 0.9 
4.5 ± 0.6 
3.4 ± 0.7 
5.3 ± 0.4 
5.0 ± 0.4 



Scoring of the histological specimens was performed as described in the Patients and Methods section. Values are the means ± SD of the number 
of samples indicated in parenthesis. P values are based on comparisons with the respective controls in the normal samples. 
*, P < 0.02; f P < 0.01; *P = 0.004; § P = 0.001. 



adenoma formation in these animals. 35 The growth-pro- 
moting effects of Id genes are thought to occur through 
several mechanisms. For example, ld-2 can bind to mem- 
bers of the pRB tumor suppressor family, thus blocking 
their growth-suppressing activity, 20,21 and ld-1 and ld-2 
can antagonize the bHLH-mediated activation of known 
inhibitors of cell cycle progression such as the cyclin- 
dependent kinase inhibitor p21. 23 

In the present study, we determined by Northern blot 
analysis that a significant percentage of human pancre- 
atic cancers expressed increased ld-1, ld-2, and ld-3 
mRNA levels. Increased expression was most evident for 
ld-1 (6.5-fold) and ld-2 (fivefold). In contrast, ld-3 mRNA 
levels were only twofold increased in the cancer samples, 
partly because this mRNA was present at relatively high 
levels in the normal pancreas. Immunhistochemical anal- 
ysis confirmed the presence of ld-1, ld-2, and ld-3 in the 
cancer cells within the tumor mass, whereas in the normal 
pancreas faint ld-1 and id-2 immunoreactivity and mod- 
erate to occasionally strong ld-3 immunoreactivity was 
present in some ductal cells. Pancreatic acinar and islet 
ceils in the normal pancreas were devoid of ld-1, ld-2, 
and ld-3 immunoreactivity. In the cancer samples, all 
three Id proteins often colocalized in the cancer cells. 
Coexpression of all three Id genes was also observed in 
cultured pancreatic cancer cell lines, which often exhib- 
ited a close correlation between Id mRNA and protein 
expression. However, in MIA-PaCa-2 there was a diver- 
gence of ld-2 mRNA and protein levels, and in PANC-1 
cells, ld-3 mRNA levels did not correlate well with ld-3 
protein expression. These observations suggest that in 
these cells, the half-life of either Id mRNA or Id protein 
may be altered by comparison with the other cell lines. 
Interestingly, ld-2 immunoblotting revealed two closely 
spaced bands of approximately 16 and 18 kd in 4 of 5 
cell lines. In view of the fact that two possible initiation 
codons have been reported for the ld-2 gene, 36 our 
observation raises the possibility that the two ld-2-immu- 
noreactive bands may represent separate translation 
products of the !d-2 gene. 

Pancreatic cancers often harbor p53 tumor suppressor 
gene mutations 37 and exhibit alterations in apoptosis 
pathways. Thus, these cancers often exhibit increased 
expression of anti-apoptotic proteins such as Bcl-2 38 and 
abnormal resistance to Fas-ligand-mediated apopto- 
sis 39 It has been shown recently that forced constitutive 
expression of Id genes together with the expression of 
anti-apoptotic genes such as Bcl-2 or BclX L can result in 



malignant transformation of human fibroblasts, 11 raising 
the possibility that the enhanced Id expression in pan- 
creatic cancers together with increased expression of 
anti-apoptotic genes may contribute to the malignant 
potential of pancreatic cancer cells in vivo. 

In the CP tissues there was no significant increase in 
ld-1, ld-2, and ld-3 mRNA levels in comparison to the 
normal pancreas. Immunohistochemical analysis of pan- 
creatic cancer samples revealed colocalization of weak 
to moderate ld-1, ld-2, and ld-3 immunoreactivity in pro- 
liferating ductal cells in the CP-like regions adjacent to 
the cancer cells, indicating that Id expression was not 
restricted to the cancer cells. Similarly, analysis of CP 
samples indicated weak ld-1, ld-2, and ld-3 immunore- 
activity in the cells of small proliferating ducts and large 
ducts without dysplastic changes. In general, there was a 
correlation between weak immunoreactivity and low Id 
mRNA levels. However, in samples that harbored large 
ducts with papillary structures there was moderate Id 
immunoreactivity, and in the ceils forming dysplastic 
ducts there was moderate to strong Id immunoreactivity. 
In these CP samples, Id mRNA levels were relatively 
higher than in the CP samples that were devoid of these 
histological changes. Overall, however, increased Id ex- 
pression, most notably of ld-1 and ld-2, distinguished a 
subgroup of pancreatic cancers from CP (Table 1). 

Epidemiological studies have shown that the risk of 
developing pancreatic cancer is increased up to 16-fold 
in patients with pre-existing CP in comparison to the 
general population 40 The mechanisms that contribute to 
neoplastic transformation in CP are not known. Although 
there is no established tumor progression model for pan- 
creatic cancer, such as the adenoma-carcinoma se- 
quence of colorectal carcinoma, 41 it is generally ac- 
cepted that K-ras and p16 mutations occur relatively 
early in pancreatic carcinogenesis, whereas p53 muta- 
tions occur late in this process. 37,4 1-43 Increased Id ex- 
pression may contribute to malignant transformation of 
cultured cell lines in w'fro 11 and has been linked to cell 
invasion in a murine mammary epithelial cell line 44 In 
view of the current findings that ld-1, ld-2, and ld-3 are 
overexpressed in pancreatic cancer and in dysplastic/ 
metaplastic ducts in CP, these observations raise the 
possibility that elevated levels of ld-1, ld-2, and, to a 
lesser extent, ld-3 may represent relatively early markers 
of pancreatic malignant transformation and may contrib- 
ute to the pathobiology of pancreatic cancer. 
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Emerging high-throughput screening technologies 
are rapidly providing opportunities to identify new 
diagnostic and prognostic markers and new thera- 
peutic targets in human cancer. Currently, cDNA ar- 
rays allow the quantitative measurement of thou- 
sands of mRNA expression levels simultaneously. 
Validation of this tool in hospital settings can be done 
on large series of archival paraffin-embedded tumor 
samples using the new technique of tissue microar- 
ray. On a series of 55 clinically and pathologically 
homogeneous breast tumors, we compared for 15 
molecules with a proven or suspected role in breast 
cancer, the mRNA expression levels measured by 
cDNA array analysis with protein expression levels 
obtained using tumor tissue microarrays. The validity 
of cONA array and tissue microarray data were first 
verified by comparison with quantitative reverse tran- 
scriptase-polymerase chain reaction measurements 
and immunohistochemistry on full tissue sections, 
respectively. We found a good correlation between 
cDNA and tissue array analyses in one-third of the 15 
molecules, and no correlation in the remaining two- 
thirds. Furthermore, protein but not RNA levels may 
have prognostic value; this was the case for MUC1 
protein, which was studied further using a tissue mi- 
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had prognostic value. Thus, differences extended to 
clinical prognostic information obtained by the two 
methods underlining their complementarity and the 
need for a global molecular analysis of tumors at both 
the RNA and protein levels. (Am J Pathol 2002, 
161:1223-1233) 

The development of genomic, technological, and bioin- 
formatic tools have allowed progress in cancer research. 
DNA arrays are currently the most used of the new high- 
throughput methods to analyze the molecular complexity 
of tumors. Several studies have showed their potential in 
many types of human cancers. 1 " 4 Even if the clinical 
benefits for patients remain to be demonstrated, the first 
results are very encouraging. DNA arrays-based gene 
expression profiles are improving our understanding of 
the disease as well as tumor taxonomy by identifying new 
diagnostic or prognostic subclasses unrecognized by 
usual parameters. They are expected to lead to the dis- 
covery of new potential therapeutic targets, to accurate 
predictions of survival and response to a given treatment, 
and eventually to the delivery of a therapy appropriate to 
each individual patient. 

Once a potential marker is identified by this technique, 
an important next step is its validation and introduction in 
routine tests in hospital settings. 5,6 There, cDNA arrays 
are not the method of choice because they are still ex- 
pensive, time-consuming, complex, and require frozen 
material not always available. Validation studies have 
been done traditionally by immunohistochemistry (IHC) 
on paraffin-embedded tissues allowing analysis of many 
archived samples with a long follow-up. Until recently, 
pathologists examined sections of tumor slide by slide. 
Today, the recently developed tissue microarray (TMA) 
technology 7 " 9 allows the simultaneous analysis of 
thousands of tumor samples arrayed onto glass slides. 
This may facilitate the search for correlations between 



Supported by INSERM, institut Paoli-Calmettes, and grants from 
f'Association pour fa Recherche sur le Cancer and la Ligue Nationale 
contre le Cancer. 

ECJ and FB contributed equally to this work. 

Accepted for publication June 20, 2002. 

Address reprint requests to Daniel Birnbaum, U119 INSERM. Institut 



1 224 Ginestier et a) 

AJP October 2002, Vol. 161, No. 4 



molecular alterations and the histoclinical features of 
the tumors. 

In a recent cDNA array-based, prognosis-oriented 
study of 55 localized breast carcinoma samples, 10 we 
identified two clusters of discriminator genes (named I 
and II) the differential expression of which allowed to 
distinguish subclasses of tumors with significantly differ- 
ent clinical outcome after adjuvant chemotherapy. The 
aim of the present study was to validate some of these 
data using TMAs and to evaluate the interest and limita- 
tions of this technology as a validation tooL Cylinders 
from the same 55 tumors were arrayed in a specific 
tissue-microarray and studied by IHC using antibodies 
directed against proteins encoded by some of our dis- 
criminator genes. 

Materials and Methods 
Mammary Carcinoma Cell Lines 

Nine established mammary carcinoma cell lines were 
used as positive controls for expression of various genes 
or proteins. They included: BT-474, MCF-7, MCF-10F, 
MDA-MB-157, MDA-MB-175, MDA-MB-231, MDA-MB- 
453, BrCa-MZ-02, 11 and HBL-100. All cell lines are de- 
rived from carcinomas except HBL-100 and MCF-10F. 
They were obtained from the American Type Culture 
Collection, Rockville, MD {http://www.atcc.org/) and 
grown using the recommended culture conditions. 

Breast Tumor Samples and Characteristics of 
Patients 

Tumor samples were obtained from 55 women treated at 
the (nstitut Paoli-Calmettes. Inclusion criteria were: 1) 
localized breast cancer treated with adjuvant anthracy- 
clin-based chemotherapy in addition to loco-regional 
' treatment; 2) tumor material quickly macrodissected and 
frozen in liquid nitrogen and stored at -160°C; and 3) 
patient follow-up of 48 months or more after diagnosis. In 
addition to the axillary lymph node status, four poor prog- 
nosis criteria were used to determine whether adjuvant 
chemotherapy should be administered: patient age less 
than 40 years, pathological tumor size greater than 20 
mm, Scarff-Bloom-Richardson grade equal to 3, and neg- 
ative estrogen receptor (ER) status as evaluated by IHC 
with a positivity cutoff value of 1%. Women who received 
chemotherapy were those with either node-positive tu- 
mors or node-negative tumors and one of the poor prog- 
nosis criteria if nonmenopausal or two criteria if meno- 
pausal. All tumor sections were de novo reviewed by a 
pathologist (JJ) before analysis; all samples contained 
more than 50% tumor celts. Tumors were infiltrating ad- 
enocarcinomas including, according to the World Health 
Organization histological typing, 42 ductal, 5 lobular, 5 
mixed, and 3 medullary carcinomas. 

A second series of breast tumors was analyzed. It was 
constituted by 592 localized forms of breast cancer col- 



nitrogen (the 55 tumors previously described were in- 
cluded in this array). There were 401 ductal, 77 lobular, 
40 mixed, 4 medullary carcinomas, and 70 other histo- 
logical types. A total of 297 tumors were node positive 
and 450 were positive for ER. 

Extraction ofRNA from Frozen Tissue 

Total RNA was extracted from tumor samples by stan- 
dard methods, as previously described. 12 RNA integrity 
was controlled on denaturing formaldehyde-agarose gel 
electrophoresis and Northern blots using a 28S-specific 
oligonucleotide. 

DNA Arrays 

DNA arrays were made in our facility (Technologies 
Avance6s pour le Genome et la Ciinique)). Nylon filter 
preparation with spotted polymerase chain reaction 
(PCR) products derived from -1000 selected candidate 
cancer genes, 33 P radioactive hybridization, and data 
acquisition, normalization, and analysis have been de- 
scribed elsewhere 13,14 and can also be consulted on our 
web site (http:/tagc.univ-mrs.fr/pub/Cancer/). 

Reverse Transcription 

RNA extracted from frozen tissue was reverse-tran- 
scribed in a final volume of 20 /ml containing 1 x reverse 
transcriptase (RT)-PCR buffer (Invitrogen Corp., Carts- 
bad, CA) , 5 mmol/L MgCI 2 (Invitrogen), 1 mmoi/L dXTP 
(Roche Diagnostics, Meylan, France), 10 mmol/L dithio- 
threitol (Invitrogen), 5 ^mol/L random hexamers (Roche), 
20 U of RNase inhibitor (Promega Biosciences, Madison, 
Wt) , 200 U of superscript reverse transcriptase (Invitro- 
gen), and 1 /xg of total RNA (calibration curve points and 
patient samples). Samples were incubated at 20°C for 10 
minutes and 42°C for 45 minutes; reverse transcriptase 
was inactivated by heating at 99°C for 3 minutes and 
cooling at 4°C for 5 minutes. 

Real-Time Quantitative RT-PCR (RQ-PCR) 

RQ-PCR analyses for ERBB2, MUC1 , and 7BP (TATA box 
binding protein) mRNA were done using the ABI PRISM 
7700 Sequence Detection System instrument and soft- 
ware (Perkin Elmer Applied Biosystems, Foster City, CA). 
Conditions for the analysis of these markers have been 
described. 15 * 16 Primers and probes for the TaqMan sys- 
tem were designed to meet specific criteria by using 
Primer Express software (Perkin Elmer) and were synthe- 
sized by Genset (Genset Olijos, La Jolla, CA, USA) for the 
primers and by Roche for the probes. The 5'- and 3'-end 
nucleotides of the probe were labeled with a reporter (FAM, 
6-carboxy-f luorescein) and a quencher dye (TAMRA, 6-car- 
boxy-tetramethylrhodamine). The sequences of the PCR 
primer pairs and fluorogenic probes used for each gene 
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Table 1. Sequences of Oligonucleotide Primers and Probes Used in RQ-PCR Experiments 



Gene 



ERBB2 

MUC1 

TBP 



Oligonucleotide 



Forward primer 
Reverse primer 
Probe 

Forward primer 
Reverse primer 
Probe 

Forward primer 
Reverse primer 
Probe 



Sequence 



PCR product size 



5'-AGCCGCGAGCACCCAAGT-3' (exon 1) 

5'-TTGGTGGGCAGGTAGGTGAGTT-3' (exon 2) 

5'-CCTGCCAGTCCCGAGACCCACCT-3' 

5 ' - ACCATCCTATG AGCG AGTACC-3' (exon 6) 

5'-GTTTCTGCAGGTAATGGTGGC-3' (exon 7) 

5'-CCCATGGGCGCTATGTGCCC-3' 

5'-CACGAACCACGGCACTGATT-3' 

5'-TTTTCTTGCTGCCAGTCTGGAC-3' 

5'-TGTGCACAGGAGCCAAGAGTGAAG-3' 



147 bp 
107 bp 
89 bp 



Bank accession no. M 11 730, MUC1 GenBank accession 
no. J05581, TBP GenBank accession no. X54993. The 
precise amount of total RNA added to each reaction mix 
(based on absorbance) and its quality (ie, lack of exten- 
sive degradation) are both generally difficult to assess. 
Therefore, the relative expression level of the gene of 
interest was computed with respect to the internal stan- 
dard TBP to normalize for variations in the quality of RNA 
and the amount of input cDNA. Ct (threshold cycle) was 
used for quantification of the input target number and all 
experiments were done with duplicates for each data 
point. All patient samples with a variation >1 Ct for the 
duplicate were retested. For each experimental sample, 
the amount of target and endogenous reference was 
determined from a standard curve. The standard curve 
was constructed with fivefold serial dilutions of cDNA 
(1000 ng to 1 ng) from BT-474 (for ERBB2) and MCF-7 
(for MUC1) breast carcinoma cell lines, respectively. The 
relative target gene expression in a tested sample was 
normalized using a calibrator sample, ie, the HME1 hu- 
man primary mammary epithelial cell line (Clontech). The 
level of expression of the target gene was given by the 
N-ratio, in which each normalized gene value (ERBB2, 
MUC1) was divided by a calibrator normalized gene 
value (TBP). 



ERBB2. 



SAMPLE 



^ERBBS " 



TBP, 



SAMPLE 



ERBB2, 



CALIBRATOR 



TBPr 



_ MUC1 SAMPLE / MUC 1 CALIBRATOR 



TBPq 



TBP r . 



PCR was done with 1 x TaqMan Universal PCR Master 
Mix (Perkin Elmer), 300 nmol/L of primers, 200 nmol/L of 
the probe, and 1 jul of each appropriately diluted reverse 
transcription sample in a 25-/il final reaction mixture. 
After a 2-minute incubation at 50°C to allow for uracyl 
N-glycosytate cleavage, AmpliTaq Gold was activated by 
an incubation for 10 minutes at 95°C. Each of the 40 PCR 
cycles consisted of 15 seconds of denaturation at 95°C 
and hybridization of probe and primers for 1 minute at 
60°C. 

TMA Construction 

TMAs were prepared as described 9 with slight modifica- 
tions. For each tumor, three representative tumor areas 
were carefully selected from a hematoxylin- and eosin- 
stained section of a donor block. Core cylinders with a 
diameter of 0.6 mm each were punched from each of 
these areas and deposited into a recipient paraffin block 



Table 2. List of Proteins Tested by Immunohistochemistry and Characteristics of the Corresponding Antibodies 



Protein 



Antibody 



Angiogenin (ANG) 


Rabbit polyclonal 


BCL2 


mmab 


ECadherin (CDH1) 


mmab 


ERBB2 


mmab 


ERBB2 


mmab 


ERBB2 


Rabbit polyclonal 


Estrogen receptor (ESR1/ER) 


mmab 


FGFR1 


Rabbit polyclonal 


GATA3 


mmab 


Ki67 


mmab 


Melan A/MART 1 (MLANA) 


mmab 


MUC1 


mmab 


P53 


mmab 


Progesterone receptor (PR) 


mmab 


Prolactin receptor (PRLR) 


mmab 


Transforming acidic coiled-coil 1 TACC1 


Rabbit polyclonal 


Transforming acidic coiled-coil 2 TACC2 


Rabbit polyclonal 


Thrombospondin 1 (THBS1) 


mmab 



Origin 


Clone 


Dilution 


Santa Cruz Biotechnology 


sc-9044 


1/20 


DAKO 


124 


1/100 


Transduction Laboratories 


36 


1/2000 


Novocastra Laboratories Ltd. 


CB 11 


1/500 


Oncogene Research 


3B5 


1/500 


Products 






DAKO 


AO 485 


1/1000 


Novocastra Laboratories Ltd. 


6F11 


1/60 


Santa Cruz Biotechnology 


sc-121 


1/200 


Santa Cruz Biotechnology 


sc-268 


1/100 


DAKO 


KI-67 


1/100 


DAKO 


A103 


1/2 


Transgen 


H23 


1/1000 


Immunotech 


DO-1 


1/4 


DAKO 


PgR 636 


1/80 


NeoMarkers 


B6.2 


1/200 


Upstate Biotechnology 


07-229 


1/200 


Upstate Biotechnology 


07-228 


1/40 


Oncogene Research 


46.4 


1/10 


Prnrtt icts 
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using a specific arraying device (Beecher instruments, 
Stiver Spring, MD). In addition to tumor tissues, the re- 
cipient block also received normal breast tissue and cell 
line pellets. Five-^m sections of the resulting microarray 
block were made and used for IHC analysis after transfer 
to glass slides. Two TMAs were prepared; the first one 
contained the 55 tumors studied by cDNA arrays (with 
three cores per sample) and controls, the second one 
was used for MUC1 study and contained 592 tumor 
samples (with one core per sample) and controls. 



Antibodies and IHC 

The characteristics of the antibodies used are listed in 
Table 2. IHC was performed on 5-ftm sections of forma- 
lin-embedded tissue specimens. They were deparaf- 
finized in histolemon (Carlo Erba Reagenti, Rodano T Italy) 
and rehydrated in graded alcohol. Antigen enhancement 
was done by incubating the sections in target retrieval 
solution (DAKO, Copenhagen, Denmark) as recom- 
mended except for prolactin receptor, in which pretreat- 
ment was done with incubation in pepsin (Zymed Labo- 
ratories, South San Francisco, CA), for 30 minutes at 
37°C, and for MUC1 , in which no pretreatment was done. 
Slides were then transferred to a DAKO autostainer. 
Staining was done at room temperature as follows: after 
washes in phosphate buffer, followed by quenching of 
endogenous peroxidase activity by treatment with 0.1% 
H 2 0 2 , slides were first incubated with blocking serum 
(DAKO) for 10 minutes and then with the affinity-purified 
antibody for 1 hour. After washes, slides were incubated 
with biotinyiated antibody against rabbit Ig for 20 minutes 
followed by streptavidin-conjugated peroxidase (DAKO 
LSAB R 2 kit). Diaminobenzidine or 3-amino-9-ethylcarba- 
zole was used as the chromogen, counterstained with 
hematoxylin, and coverslipped using Aquatex (Merck, 
Darmstadt, Germany) mounting solution. Slides were 
evaluated under a light microscope by two pathologists 
(EC-J, JJ). 

Immunoreactivities were classified by estimating the per- 
centage (P) of tumor cells showing characteristic staining 
(from undetectable level or 0%, to homogeneous staining or 
100%) and by estimating the intensity (I) of staining (1 , weak 
staining; 2, moderate staining; or 3, strong staining. The 
cutoff values were the same for all markers tested. Results 
were scored by multiplying the percentage of positive cells 
by the intensity, ie, by the so-called quick score (Q) (Q = 
P x I; maximum = 300). For Ki67, only the percentage (P) 
of tumor cells was estimated, because intensity does not 
vary. Expression levels allowed to group tumors into four 
categories: negative expression (Q = 0 or P = 0 for Ki67), 
weak expression (0 < Q < 120 or 0 < P < 25 for Ki67), 
moderate expression (120 < Q s= 210 or 25 < P < 60 for 
Ki67) and strong expression (210 < Q < 300 or 60 < P < 
100 for Ki67). Because of its prognostic impact the topo- 
graphical localization of MUC1 was taken into account and 
expressed in four categories: absence, apical, circumfer- 
ential membrane, and cytoplasmic 



IHC on Full Tissue Sections 

To validate the use of TMAs for immunophenotyping, we 
compared the protein expression levels of ER, proges- 
terone receptor, P53, and BCL2, on full tissue sections 
and on TMAs for the group of 55 tumors. The data on full 
sections were compared to the mean of intensities of the 
three 0.6-mm core biopsies for 47 cases, or of only two 
core biopsies for 8 cases. 

Statistical Analysis 

The concordance between RNA expression levels mea- 
sured by real-time quantitative RT-PCR and cDNA arrays 
was examined using Spearman's rank correlation. Com- 
parison between IHC data from full sections and TMAs 
analyses was measured using k statistics (a k value >0.7 
indicated a strong association). Contingency table anal- 
ysis was used to analyze the relationship between protein 
expression obtained by IHC on TMAs and RNA expres- 
sion obtained with cDNA arrays (total chi-square test). 
Survival analysis used the Kaplan-Meier method and sur- 
vival curves were compared using the tog-rank test (a P 
value <0.5 was considered as significant). All P values 
were two-sided. To assess the relationship between two 
variables assumed to be related (ie, the co-regulated 
molecules), simple linear regression analyses were per- 
formed using Excel Software (Microsoft). For these tests, 
(O.O) points were removed; the relationship tested was 
thus for cases with at least one positive value. Each result 
is given with: N the sample size, a the slope of the 
regression line, the P value and r2, the coefficient of 
determination. Thus, for each positive comparison a lin- 
ear relationship can be determined (eg, y = 0.8x + 20 
means BCL2 = 0.8ER + 20). 



Results 

Selection of Molecules 

We previously analyzed the mRNA expression profiles of 
-1000 selected genes in 55 breast carcinoma samples 
using home-made cDNA arrays. Tumors were homoge- 
neous with respect to histological and clinical parame- 
ters, and all patients had received adjuvant anthracyclin- 
based chemotherapy. Detailed results are described 
elsewhere. 10 Briefly, molecular profiling combined with 
hierarchical clustering allowed the identification, among 
this set of poor-prognosis localized breast cancers, of 
new subclasses distinct with respect to overall and me- 
tastasis-free survivals. Such a classification resulted from 
the differential expression of two discriminator gene clus- 
ters (named I and II) and was not possible using classical 
prognostic factors of disease. Cluster I included the ESR1 
gene encoding ER-a. For the present study, we selected 
10 of these genes. Interestingly, six of them (BCL2, 
ERBB2, ESR1, GATA3, MUC1 , PRLR) have also been 
frequently identified as discriminator genes in expres- 
sion-profiling studies of breast cancer that have ad- 
dressed the prognosis issue. 4 - 17 " 19 These genes were 
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Figure 1. Expression levels of ERBB2 and MUC1 mRNA levels measured by 
cDNA array analysis and real-time quantitative PCR amplification. ERBB2and 
MUC1 mRNA expression levels measured using cDNA arrays (artificially X30 
for visual effect) (left) and real-time quantitative PCR amplification (artifi- 
cially X30 for visual effect) (right). Results for each tumor (from top to 
bottom) are represented as opposite bars. For ERBB2. p s = 0.78, P < 
0.0001; for MUC1. p s = 0.88, P < 0.0001. 



thus interesting candidates for further investigation, tn 
addition, other molecules, such as CDH1 t Ki67, TP53, 
progesterone receptor, TACC1, 20 and TACC2, were re- 
tained because of a known or suspected role in breast 
cancer. The selection criteria for all molecules also in- 
cluded availability of a commercial antibody. The com- 
plete list of the corresponding proteins tested in the fol- 
lowing experiments is given in Table 2. 

Validation ofcDNA Array Data with RQ-PCR 

Our cDNA array analyses regularly included extensive 
experiments and controls designed to ensure reproduc- 
ibility and reliability of expression measurements. 1,13 - 14 - 21 
Nevertheless, we sought to further validate our data by 
comparing RNA expression levels of two genes, ERBB2 
and MUC1 , as measured by cDNA array, to those ob- 
tained by RQ-PCR. 

RNA from 50 of 55 samples (RNA was no longer avail- 
able for five cases) was reverse-transcribed and PCR 
amplification of ERBB2 and MUC1 cDNA was done using 
a TaqMan device. For ERBB2, 41 tumors displayed 
mRNA expression levels comparable to normal breast 
and HME1 control cell line, whereas nine samples (18%) 
showed overexpression. For MUC1 , 17 tumors displayed 



mRNA expression levels comparable to normal breast 
and HME1 control cell line, whereas 33 samples (66%) 
showed overexpression. As shown in Figure 1, mRNA 
expression levels obtained with both methods were 
highly similar (Spearman test: ERBB2, p s = 0.78, P < 
0.0001; MUC1, Ps - 0.88, P < 0.0001), further suggest- 
ing reliability of our cDNA array data. 



TMA Analysis and Validation of Data 

To validate our TMA analyses, we compared the expres- 
sion of four selected proteins (BCL2, ER, P53, progester- 
one receptor) measured by IHC using either standard full 
tissue sections or TMAs in the panel of 55 breast tumors. 
For BCL2 expression, 38 cases (69%) showed positive 
cytoplasm staining, whereas 17 cases (31%) were neg- 
ative on analysis of full sections. In comparison, 37 cases 
(67%) were positive and 18 cases were negative (33%) 
on TMA. Overall, the concordance was 91% and the 
nonconcordance was 9% (five cases), resulting in a 
strong statistical association between the two methods (k 
value, 0.78). An even better correlation was found for 
nuclear expressions of ER, P53, and progesterone recep- 
tor, with only 3 discordant cases of 55 for each of them 
(concordance, 95%; Kappa values, 0.86 to 0.88). This 
high degree of concordance between IHC on full sec- 
tions and on TMAs justified further use of TMAs. 



Analysis of Breast Tumors Using TMAs 

Fifteen proteins, including the four previously cited, were 
tested by IHC on TMAs. Most of them corresponded to 
genes we had identified in our two discriminator gene 
clusters I and IL 10 Other tested molecules corresponded 
to proteins of interest in breast cancer. Immunostainings 
were evaluated by the quick score (except for Ki67). 
Results are shown in Table 3 and Figure 2. 



Table 3. Results of IHC Statnings on Tissue Microarrays 



Protein Location of staining Normal Negative Weak Moderate Strong 



cDNA array, cluster I gene-encoded 


ANG 


Cytoplasm + Stroma 


(+) 


17 


19 


5 


14 


BCL2 


Cytoplasm 


(+) 


23 


10 


17 


5 




ESR1/ER 


Nucleus 


(+) 


22 


14 


5 


14 




GATA3 


Nucleus 


(+) 


26 


12 


7 


10 




MUC1 


Cytoplasm 


(+) 


3 


19 


8 


25 




THBS1 


Cytoplasm + Stroma 


(+) 


16 


30 


9 


0 


cDNA array, cluster II gene-encoded 


MLANA 


Cytoplasm 


(+) 


22 


20 


6 


7 


PRLR 


Membrane 


(+) 


21 


12 


7 


15 


Others 


CDH1 


Membrane 


(+) 


6 


10 


14 


25 


ERBB2 (CB 11) 


Membrane 


(-) 


34 


14 


4 


3 




ERBB2 (A0485) 


Membrane 


(-) 


30 


11 


9 


5 




ERBB2 (3B5) 


Membrane 


(-) 


37 


8 


2 


8 




FGFR1 


Membrane + Cytoplasm 


<+) 


20 


20 


12 


3 




Ki67 


Nucleus 


(+) 


4 


27 


13 


11 




P53 


Nucleus 


(-) 


33 


12 


0 


10 




TACC1 


Cytoplasm 


(+) 


21 


22 


9 


3 




TACC2 


Cytoplasm 


(+) 


5 


18 


13 


19 



(+) and (-) mean expressed or not in normal breast tissue, respectively. 
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Figure 2. Expression of proteins studied by IHC on TMAs. A: H&E staining 
of a paraffin block section (25 X 30 mm) from the TMA containing 216 
arrayed tumor (3 X 55) and control samples. B: Anti-angiogenin staining. O 
Anti-FGFRl staining. D: Anti-GATA3 staining. E: Anti-PRLR staining. From B 
to E, the first section is from normal breast tissue, the second and third from 
tumor tissue (the second illustrates a moderate staining whereas the third 
illustrates a strong staining). Original magnifications, X50. 



Comparison of the Results Obtained by cDNA 
Arrays and TMAs 

Expression levels obtained by IHC on TMA and by cDNA 
array hybridizations were compared for the 15 molecules. 
Data from TMA analyses are discontinuous, whereas 
those obtained by cDNA array analyses are continuous. 
To facilitate comparisons, we transformed the cDNA ar- 
ray values into discontinuous data. Tumors were then 
grouped into two or three classes for each method (Table 
4). Homogeneous classes were defined for TMA, by 
grouping tumors with an equivalent staining level (see 
Table 3). For cDNA arrays, classes were visually defined 
on examination of the distribution graphs (Figure 3). 



Each tumor sample was then placed into one of the 
three TMA classes and attributed 1 , 2, or 3, and into one 
of the three cDNA array classes and attributed 1 , 2, or 3. 
Table 4 shows the number of samples in each class. 
Concordance between the two scores was evaluated by 
a contingency table analysis. A strong concordance was 
seen for 5 of the 15 comparisons with similar expression 
levels measured by the two methods: EFt, ERBB2, and 
GATA3 (P < 0.001), BCL2 (P < 0.02), and TACC1 (P < 
0.05). No concordance was seen for ANG, CDH1, 
FGFR1, Ki67, MLANA, MUC1, P53, PRLR, TACC2, and 
THBS1. Figure 4 shows example of comparative graphs. 

Groups of Co-Regulated Molecules 

Using cDNA arrays and hierarchical clustering, we had 
evidenced a co-expression of ESR1 (encoding ER-a), 
6CL2, and GATA3 at the rnRNA level in breast tumors, 1,10 
with a statistically significant correlation between ESR1 
and GATA3 (r = 0.73, P 2 = 0.53, P < 0.0001). As shown 
in Figure 5A, the correlation between the three molecules 
was statistically confirmed at the protein level as mea- 
sured by IHC on TMA. FGFR1, TACC1, and TACC2 pro- 
tein levels also varied together but the correlation was 
weaker (Figure 5B). For each pairwise comparison, with 
the same number of samples (n — 55), we calculated a 
coefficient of correlation and a P value: BCL2/ER, r = 
0.79, R 2 = 0.62, P < 0.0001; GATA3/ER, r = 0.74, P 2 - 
0.54, P < 0.0001; TACC1/FGFR1, r = 0.67, R 2 = 0.45, 
P < 0.001; and TACC2/FGFR1, r = 0.57, P 2 - 0.32, P < 
0.001. 

Impact on Survival of RNA and Protein 
Expression Levels 

To further estimate the clinical interest of the cDNA array 
and TMA combined approach, we examined and com- 
pared the prognostic information provided by rnRNA and 
protein expression levels for each of the 15 molecules 
independently. Only 2 of the 15 tested markers showed 
individual prognostic value. High THBS1 rnRNA levels 
were associated with a better survival whereas no such 
correlation was found with protein levels. The opposite 
was true for MUC1: low levels of MUC1 protein were 
associated with a better survival, whereas rnRNA levels 
did not correlate with survival (Figure 6). Thus, depend- 
ing on the marker, clinically relevant information was 
differently provided by cDNA or TMA technique, suggest- 
ing that both analyses are worth performing simulta- 
neously on the same cases. 

These results were obtained on a limited number of 
cases representing a selected population of poor prog- 
nosis localized tumors. We sought to confirm the obser- 
vation on MUC1 on a larger series of cases (Figure 7A). 
We studied 592 samples (including the 55) arrayed in a 
second TMA with anti-MUC1 antibody. MUC1 staining in 
normal cells is either absent or detected in the apical 
membrane; tumor cells express MUC1 in two abnormal 
localizations (cytoplasm or circumferential membrane) 
and a strong cytoplasmic staining is associated with a 
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Table 4. Comparison of Expression Levels Measured Using Analyses of Tissue Microarrays and cDNA Arrays 



Gene 




Tissue microarray classes 




cDNA array classes 


Concordance 


1 


2 


3 


1 


2 


3 


P values 


ESR1/ER 


22 (N) 


19 (W +M) 


14 (S) 


15 


22 


18 


<0.001 


BCL2 


23 (N) 


10 (W) 


22 (M+S) 


18 


37 


0 


<0.02 


P53 


33 (N) 


22 (W + M+S) 


/ 


46 


9 


0 


NS 


GATA3 


26 (N) 


12(W) 


17 (M+S) 


18 


25 


12 


<0.001 


PRLR 


21 (N) 


19 (W +M) 


15 (S) 


36 


12 


7 


NS 


ERBB2 (3B5) 


37 (N) 


10 (W +M) 


8(S) 


46 


4 


5 


<0.001 


CDH1 


16 (N +W) 


14 (M) 


25 (S) 


42 


13 


0 


NS 


TACC2 


23 (N +W) 


13 (M) 


19 (S) 


19 


18 


18 


NS 


TA^P A 

1 AUO 1 




00 (\N\ 




19 


18 


18 


<0.05 


MLANA 


22 (N) 


20 (W) 


13 (M+S) 


34 


21 


0 


NS 


FGFR1 


20 (N) 


20 (W) 


15 (M+S) 


45 


10 


0 


NS 


ANG 


17 (N) 


19 (W) 


19 (M+S) 


17 


30 


8 


NS 


THBS1 


16 (N) 


30 (W) 


9 (M+S) 


46 


6 


8 


NS 


Ki67 


31 (N +W) 


24 (M + S) 


/ 


11 


33 


11 


NS 


MUC1 


22 (N +W) 


33 (M + S) 


/ 


39 


8 


8 


NS 



N, Negative; W, weak; M, moderate; S. strong; NS, not significant. 

Numbers tor tissue microarrays are taken from Table 3 and numbers for cDNA arrays are obtained using the method shown in Figure 3. 



poor prognosis. 22 For the 55 tumors of the first TMA, the 
prognostic value of the quantitative quick score was re- 
lated to a high frequency of abnormal cytoplasmic and 
circumferential MUC1 localizations (83%) as compared 
to apical localization and absence (17%). Of the 592 
cases of the second TMA, 551 were available for analysis 
after MUC1 staining: 249 cases (45%) showed apical or 
no staining, 302 (55%) displayed cytoplasmic or circum- 
ferential membrane staining (Figure 7B). In this larger 
series the quantitative quick score did not have a signif- 
icant prognostic value. This was because of the fact that 
the topographical aspect was significantly different from 
that of the short series with only 55% versus 83% of 
cytoplasmic and circumferential localizations. When con- 
sidered, qualitative assessment of the staining provided 
prognostic information; the apical localization and the 
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Figure 3. Transformation of continuous cDNA array data into discontinuous 
data. mRNA expression levels measured by cDNA array are plotted for each 
sample in an increasing order. For each gene, classes are determined on 
visual inspection and are separated by vertical bars on the graphs. Results 
for ER-or (£S/?J), prolactin receptor (PRIM), mucin 1 (MVC1), and ERBB2 are 
shown. 



absence of MUC1 strongly correlated with a better evo- 
lution (P = 0.0154) (Figure 7C). 



Discussion 

The recent availability of new high-throughput molecular 
analyses offers the opportunity to tackle the complexity 
and the combinatorial nature of breast cancer at the 
molecular level. Expected applications are a better un- 
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Figure 4. Comparison of data obtained by cDNA array and IHC on TMA. 
Results for each tumor (from top to bottom) are represented as opposite 
bars, with the value of JHC (quick score) on the left, and the value of the 
cDNA array analyses (artificially X30 for visual effect) on the right. Values 
for ER, GATA3, and ERBB2 show good correlation between the two methods, 
whereas values for P53, THBS1, and MUCI do not show such correlation. 
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Figure 5. Similar variations in expression levels of two groups of proteins. A: The expression levels of ER, BCL2, and GATA3 as measured by IHC on TMAs 
correlated, as determined by simple linear regression analysis. B: Similarly, the expression levels of FGFR1, TACC1, and TACC2 correlated. 



derstanding of the disease and the identification of new 
diagnostic and prognostic markers and therapeutic tar- 
gets, both needed to improve the management of pa- 
tients. At the same time it introduces a new challenge for 
pathologists who, in charge of the first assessment of the 
tumors, need to know how to optimally use these new 
methods. The present study directly followed a cDNA 
array-based analysis of a breast tumor series. The tumor 
samples were obtained from 55 women with poor prog- 
nosis breast cancer treated with adjuvant chemotherapy. 
Currently such patients have a long-term survival of 
-70% and there is a crucial need to identify parameters 
that might accurately predict the clinical outcome in in- 
dividual patients. Our study was designed to evaluate the 
interest and limitations of IHC on TMA as a natural exten- 
sion of the cDNA array approach in a hospital setting. 

We first confronted cDNA array and TMA analyses to 
other methods, ie, RQ-PCR and conventional IHC, re- 
spectively. The good concordance between mRNA ex- 
pression levels observed by cDNA arrays and RQ-PCR 
further confirmed the validity of our cDNA array measure- 
ments. TMAs allow to screen large series of tumor sam- 
ples using several archival materials, but their represen- 
tation of the entire tumor has been questioned. Our 
degrees of concordance between stainings on full sec- 




Figure 6. Kaplan-Meier plots of patient overall survival. Left: Survival ac- 
cording to MUCJ mRNA and protein expression levels. Right* Survival 
according to THBS1 mRNA and protein expression levels (labeled high and 
low). High and low protein levels correspond to strong plus moderate versus 
weak plus negative (see Table 3), respectively, and high and low mRNA 
levels correspond to classes 2 and 3 versus class 1 (see Figure 4), respectively. 



tions and on TMA were in the same range as published 
studies. Several authors have reported that TMA con- 
structed with three cores per sample (as in our study) are 
representative of whole tumor specimens 23-30 

As a large-scale validation tool of DNA or RNA data, 
IHC on TMAs should be interpreted with caution. Indeed, 
comparison of our cDNA array and TMA data, obtained 
on the same breast tumor samples, gave different results 
according to the gene product examined. 

For a category of molecules we found important differ- 
ences between RNA and protein expression levels. This 
was the case of P53. This discrepancy was rather ex- 
pected because P53 protein detection is not dependent 
on mRNA overexpression, but is because of the in- 
creased half-life of a mutated protein. In normal cells, P53 
protein half-life is short and expression levels are low and 
undetectable by IHC. In cancer cells, most P53 mutations 
lead to products that are not ubiquitinated and accumu- 
late in the nuclei where they can then be detected. Other 
noteworthy cases were MUC1 and THBS1. These differ- 
ences certainly stem from the fact that different levels of 
biological information are examined. For many genes, 
there is little correlation between the abundance of the 
mRNA transcript with steady-state levels of the encoded 
protein. Posttranscriptional and posttranslational mecha- 
nisms are likely to influence protein expression, thus blur- 
ring the correlation between mRNA and protein levels. Pro- 
teins encoded by very low levels of RNA, ie, below the 
detection level of cDNA arrays, can be detected by IHC 
because of increased protein stability (eg, the case of P53) 
or high sensitivity of the antibody, and reciprocally, elevated 
levels of RNA may produce only little amounts of detectable 
proteins. Special calibration of the antibody aimed to detect 
only a certain level of protein is another limitation. The cho- 
sen antibody may also detect only certain forms of a protein 
that do not correspond to the cDNA sported on the DNA 
array, because of alternative splicings of mRNA for exam- 
ple. This particularly can explain the difference observed 
between THBS1 mRNA and protein levels, and conse- 
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Figure 1, Expression of MUC1 protein studied by IHC on a t issue- microarray. 
A: H&E staining of a paraffin block section (25 X 30 mm) from the TMA 
containing 647 arrayed samples, including 592 tumors and 55 controls. B: 
MUC1 staining: normal breast tissue (left), apical (middle), and cytoplasmic 
(right) staining in tumors. O Kaplan-Meier plot of patient overall survival: 
survival differs significantly according to MUC1 protein localization. A: Ab- 
sence of staining or apical localization; B: cytoplasmic or circumferential 
membrane localization. 



quently. their different prognostic impact 31 Finally, distinct 
areas of a heterogeneous tumor may be submitted to RNA 
and protein analyses. 

Conversely, we observed an excellent correlation be- 
tween RNA and protein levels in one-third of the tested 
molecules. This was the case for ERBB2, despite the fact 
that its corresponding antibody is calibrated to detect 
only overexpression. Among the other molecules with 
correlated mRNA and protein expression levels were ER, 
GATA3, and BCL2. We and others had shown that the 
mRNA levels of the three genes covaried in cDNA array 
analyses. 1,10,32 Here we were able to confirm this co- 
expression at the protein level. This group of co-regu- 
lated genes and proteins may be linked to the hormonal 
control of the mammary gland. Such identification is im- 
portant for a better understanding of gene and protein 
networks that operate in cancer cells; it may lead to the 
discovery of new molecules to be targeted to block or 
stimulate a metabolic pathway or function; it may also 
provide a prognostic information clinically more relevant 
than that of isolated markers because it better reflects the 
functional status of a pathway such as the estrogen path- 
way of breast tumors. 



Several studies have shown the interest of TMA studies 
in cancer research to extend cDNA array data 33 A pio- 
neering analysis was conducted by Moch and col- 
leagues; 8 after the identification of vimentin as overex- 
pressed in a renal cancer cell line using cDNA arrays, the 
authors extended this result to the protein level on a 
series of 532 tumor specimens arrayed onto a renal can- 
cer TMA. Using TMA of bladder tumors containing 2317 
specimens from 1842 patients, Richter and colleagues 9 
found a positive correlation between CCNE gene ampli- 
fication measured by fluorescence in situ hybridization 
and cyclin-E protein overexpression measured by IHC. 
The combination of cDNA array and TMA allowed the 
identification of IGFBP2 and HSP27, 34 hepsin 2 and AM- 
ACR 30 as significantly overexpressed in prostate cancer, 
suggesting their putative diagnostic interest. IGFBP2 was 
also found as a marker of poor prognosis in a series of 
418 brain tumors arrayed onto a TMA 35 A similar study 
showed the overexpression of the WT protein in ovarian 
cancer. 36 The expression level of PKC/3 was measured 
by IHC on a B-cell lymphoma TMA to validate cDNA array 
data 3 In breast carcinomas, Hedenfalk and colleagues 37 
showed that, tike mRNA levels, cyclin D1 protein levels 
were differentially distributed among BRCA1 and BRCA2 
hereditary tumors. All these studies showed a good cor- 
relation between the two techniques of investigation, but 
were limited to the analysis of a single highly selected 
marker and were not, with few exceptions, conducted on 
the same samples. Our present study is the first deliber- 
ate comparative analysis of cDNA and TMAs. It shows a 
correlation between the two techniques for one-third of 
the selected markers and the absence of correlation for 
the other two-thirds. 

These discrepancies deserve two commentaries. First, 
given the flurry of encouraging data associated with the 
rapidly emerging cDNA array technology, it is paramount 
to determine to what extent changes in mRNA expression 
are accompanied or not by similar changes at the protein 
level. In some cases, the differences may be eliminated 
by a number of experimental precautions, such as selec- 
tion of biopsy cores and antibodies, but in other cases, 
they will remain. If protein levels of a target molecule, or 
a group of molecules, correlate with its selection by 
cDNA array, IHC on TMA offers a powerful tool to quickly 
evaluate the clinical relevance of differentially expressed 
genes. But if they do not correlate, the cDNA array and 
TMA results must be considered independently because 
each can provide distinct information. 

Second, even if the intrinsic prognostic power of cDNA 
array data and clustering analyses derives from the com- 
bined expression of several genes, and not from an indi- 
vidual gene, it may be interesting for routine clinical ap- 
plication to test each of these genes as a candidate 
marker and to determine how its expression may alone 
distinguish the tumor classes. The main interest of TMA 
lies in the possibility to test large series of tumor samples 
with individual markers. In our series of samples, we 
observed that mRNA, but not protein expression levels of 
THBS1 had prognostic value, suggesting that they play 
an important role in the discriminator power of the cDNA 
array gene cluster. In contrast, for MUC1, as seen earli- 
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er, 38 low levels of protein were associated with a better 
prognosis, which was not the case for mRNA; IHC further 
allowed a qualitative appreciation of the protein localiza- 
tion, which happened to be crucial information for prog- 
nosis when an unselected population was studied. 

In the period of validation studies that has now begun, 
for which retrospective IHC studies on archival paraffin- 
embedded material are required, 6 it is particularly impor- 
tant to bear in mind that differences between mRNA and 
protein expression levels are possible with respect to 
intensities and to prognostic relevance. These differ- 
ences underline the complementarity or synergy between 
expression measurements from cDNA arrays and IHC on 
TMA, and also the need for other high-throughput tech- 
nologies such as cDNA arrays containing alternatively 
spliced transcripts, 39 protein arrays, 40 and in situ hybrid- 
izations on TMAs. 41 The combination of these comple- 
mentary approaches will accelerate even more the iden- 
tification of new diagnostic and prognostic markers as 
well as new therapeutic targets and will improve the 
management of breast cancer patients. 
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pl85 overexpression in 220 samples of breast cancer undergoing 
primary surgery: comparison with c-erbB-2 gene amplification. 

Dalifard I, Daver A, Goussard J, Lorimier G, Gosse-Brun S, Lortholary 
A, Larra F. 

Laboratoire de Radioanalyse, Centre Paul Papin, 49033 Angers Cedex 01, 
France. 

In breast cancer, DNA amplification of the oncogene c-erbB-2, encoding for 
the pi 85 protein, is associated with a poor prognosis. A retrospective study o 
a population of 220 cases of primary breast cancer permitted a quantitative 
measure of pi 85 oncoprotein overexpression by an immunoenzymetric assay 
and the determination of c-erbB-2 amplification by the Southern blot method 
A correlation existed between the two measurements (r^O.85) using the 
double cut-off: DNA 2 copies and pi 85 400 U/mg protein, and only 2.7% of 
the cases were discordant. 13.2% of the tumors showed pi 85 overexpression 
The percentage of tumors overexpressing pi 85 was significantly different 
between the groups with amplified and non-amplified c-erbB-2. We observe< 
a significant correlation between pi 85 levels and tumor grade (p=0.03), and 
an inverse correlation with hormonal receptors (p=0.0001). The pi 85 assay 
could be an additional prognostic factor to better define patient subgroups 
with node negative, grade II, and positive or negative hormonal receptors. 

PMID: 9852307 [PubMed - indexed for MEDLINE] 
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f anscribed gene, containing a variable number of tandem repeats, 
fes for a human epithelial tumor antigen 

^A cloning, expression of the transfected gene and over-expression in breast cancer tissue 

^'HAREUVENI 1 ' 2 y Han TSARFATY 1 , Joseph ZARETSKY 1 , Phillip KOTKES 1 , Judith HOREV 1 , Sheila ZRIHAN \ 
-^dechai WEISS 1 , Stephen GREEN 2 , Richard LATHE 2 , Iafa KEYDAR 1 and Daniel H. WRESCHNER 1 

apartment of Microbiology, Faculty of Life Sciences, Tel Aviv University, Israel 
r -boratoire de Genetique Moleculaire des Eucaryotes du Center National Recherche Scientifique, Institut de Chimie Biologique, 
•aculte de Medecine, Strasbourg, France 
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A monoclonal antibody, H23, that specifically recognizes a breast-tumor-associated antigen, was used to isolate 
a cDNA insert that codes for the antigenic epitope. Nucleotide sequencing of this cDNA, as well as a longer 
850-bp cDNA insert, shows that they are composed of 60-bp (G -f C)-rich tandem repeating units. The coding 
strand was determined and codes for a proline-rich 20-amino-acid repeat motif. A comparison of the highly 
conserved repeat unit with the deduced flanking amino acid sequences demonstrates conservation of specific 
subregions of the repeat consensus within the flanking amino acids. Hybridization of the 60-bp cDNA probe with 
RN As extracted from a variety of primary and metastatic human tumors yields relatively high levels of hybrid 
with the breast carcinomas, as compared to lower hybrid levels with RNAs from other epithelial tumors. RNA 
extracted from breast tissue adjacent to the tumor or from benign breast tumors, demonstrates low or undetectable 
levels of hybridization. Probing Southern blots with the 60-bp repeat shows that the tumor antigen is highly 
polymorphic and contains a variable number of tandem repeats (VNTRs). The VNTR nature of the gene was 
confirmed by probing Southern blots with unique genomic sequences that are physically linked to an isolated 
gene fragment that also contains the tandem repeat array. Mouse cells transfected with this gene fragment produce 
tumor antigen that is readily detected by H23 monoclonal antibodies. The allelic forms seen in 10 different primary 
human tumors demonstrate 100% concordance with the various mRNA species expressed. These studies are 
extended to the protein forms detected by immunoblot analyses that show both a correlation of the expressed 
tumor antigen species with the allelic forms as well as significantly increased expression in breast cancer tissue. 
The above studies unequivocally establish the over-expression of a VNTR gene coding for an epithelial tumor 
antigen in human breast cancer tissue. 



: ^ The isolation and characterization of proteins that are 
aj&irantly expressed in human tumor tissues may elucidate 
cellular mechanisms leading to malignancy and also be of 
Mgwficant clinical importance. To identify breast-tumor-as- 
sociated markers, we have established a human breast cancer 
cell line, T47D, that has been extensively studied and retains 
Mny characteristics of prjmtfry human breast tumors [1]. 
Monoclonal antibodies (mAb) were prepared against particu- 
late antigens released by these cells and screened for their 
specificity by the immunohistochemical staining of breast tis- 
ste sections. One mAb, designated H23, stained cytoplas- 
niically 91 % of all malignant breast tumors analyzed, whereas 
tittle or no cytoplasmic staining was observed in normal and 
{^ug^east tissues [2]. 

^ Correspondence to D. H. Wreschner, Department of Micro- 
oology, Tel Aviv University, Ramat Aviv, 1-69978, Israel 

: Abbreviations. mAb, monoclonal antibody; H23 Ag, epithelial 
tumor antigen recognized by H23 monoclonal antibodies; ORF, open 
readin g frame; VNTR, variable number of tandem repeats. 

Note. The novel nucleotide sequence data published here and in 

c preceding paper in this journal have been deposited with the 

MBL sequence data bank and are available under the accession 
jj u mber X52228 and X52229. The novel amino acid sequence data 

ave deposited with the EMBL sequence data bank. 



Other groups have also described m Abs reactive with high- 
molecular-mass glycoproteins that are aberrantly expressed 
in epithelial tumors and especially in breast cancer [3—12]. 
Several of these mAbs, namely DF3, HMFG-1, HMFG-2 and 
SM-3, were used to isolate cDNA clones that express the 
immunereactive epitope [13 — 1 5]. The cDNAs isolated all con- 
tain tandem 60-bp repeat units [13 — 15] that code for a 20- 
amino-acid repeat motif rich in proline, serine, threonine and 
alanine [14]. Southern blots probed with the 60-bp repeat 
cDNA insert show that the gene is highly polymorphic and 
correlates with the polymorphism observed in the protein 
products [13, 16]. These results suggest the codominant 
autosomal expression of a gene that contains a variable num- 
ber of tandem repeats (VNTR). 

Besides the 60-bp repeat unit, little has been known regard- 
ing the unique non-repeat sequences of the tumor antigen 
cDNA and its gene. Indeed, the aforementioned studies on 
the molecular structure of the epithelial antigen, including all 
Southern and Northern blot analyses, have been performed 
solely with the 60-bp cDNA repeat unit [13 — 16]. To study 
the breast- tumor- associated antigen recognized by H23 mAbs 
(H23 Ag) on a molecular level, a gtl 1 cDNA expression li- 
brary prepared from T47D mRNA was screened with these 



mAbs. A cDNA insert th; ^Jes for the immunereactive 
epitope was isolated and sequenced. This insert, as well as a 
longer 850-bp cDNA insert, are composed of 60-bp tandem 
repeats, similar to those previously reported [1 3 — 1 6]. We have 
recently increased our knowledge beyond the confines of the 
60-bp repeat units by isolating almost full-length cDN As that 
contain unique non-repeat sequences located 5' and 3' to the 
tandem repeat array and code for the complete epithelial 
tumor antigen [16 a]. 

We have extended these studies and report here the deter- 
mination of the VNTR nature of the gene by analyzing North- 
ern and Southern blots with probes consisting not only of the 
60-bp cDNA repeat [13-16] but also with probes derived 
from unique non-repeat genomic sequences. These investi- 
gations were performed on nucleic acids isolated from primary 
human tissues and are therefore relevant to the in-vivo situ- 
ation. In addition, we demonstrate the over-expression of 
mRNA coding for the tumor antigen and of the antigen itself 
in primary breast cancer tissues. The coding strand of the 
60-bp repeat unit has been determined and a comparative 
analysis of the tumor antigen unique non-repeat amino acid 
sequences with the 20-amino-acid repeat motif is presented. 
Finally, by transfecting cells with the isolated gene coding for 
the tumor antigen, stable mouse cell transfectants have been 
established that express the human-breast-tumor-associated 
antigen. 

The findings reported here unequivocally establish the 
over-expression in human breast cancer tissue of a VNTR 
gene that codes for an epithelial tumor antigen. 
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Southern blot DNA analysis 

High-molecular-mass DNA was isolated from powdered 
surgically removed frozen ( — 70 °C) tissues by incubating 
overnight at 50 °C in 200 ug/ml proteinase K, 100 mM NaCl 
10 mM Tris/HCl pH 7.5, 1 rmVLEDTA followed by phenol/ 
chloroform and one chloroform extraction. The DNA was 
spooled onto glass rods after the addition of 0.2 M NaCl (final 
concentration) and 1 vol. absolute ethanol at — 20°C. The 
spooled DNA was rinsed with 70% ethanol, briefly dried 
resuspended in double distilled water and kept at — 20°C 
The DNA (100 ug/ml final concn) was incubated with the 
appropriate restriction enzymes (approximately 5 units 
enzyme/ug DNA) overnight at 37 °C followed by ethanol pre- 
cipitation at — 20 °C. When double digestions were performed, 
DNA was incubated with one enzyme followed by ethanol 
precipitation, resuspension and then digestion with the second 
enzyme. 10 — 20 ug restricted DNA was electrophoresed on 
0.8% agarose gels in recirculating Tris/acetate/EDTA buffer, 
followed by staining with ethidium bromide and washing in 
1.5 M NaCl, 0.5 M NaOH for 30 min. Southern transfer to 
nylon membranes (Amersham, England) was performed in 
1.5 M NaCl, 0.25 M NaOH. The blot was irradiated with 
ultraviolet light for 3 min, followed by baking at 80°C for 2 h. 
Prehybridization, hybridization and washing were as de- 
scribed above. 



MATERIALS AND METHODS 

Plating of the recombinant cDNA phage and library screening 
with H23 mAb 

The randomly primed Agtl 1 cDNA expression library [17] 
was prepared from poly(A)-rich RNA of T47D cells, a human 
breast carcinoma cell line [2]. Approximately 10 6 phages were 
plated on Escherichia coli strain Y1090 and the resulting 
plaques were screened for expression of crossreacting galacto- 
sidase fusion protein, with 25 ug/ml H23 IgG as described 
elsewhere [18]. For the final detection of positive plaques, 1 2 5 I- 
protein A was used at a final concentration of 4 x 10 5 cpm/mL 
Positive plaques were picked and rescreened repeatedly until 
all plaques were immunopositive. Most of the X cDNA clones 
contained an insert of similar size and the clone with the 
longest insert, designated 3b, was thus obtained and used for 
Northern hybridizatioif assays. 



RNA analysis 

RNA was extracted from surgically removed frozen tissues 
using the guanidinium thiocyanate/cesium chloride method 
[20]. Poly(A)-rich RNA was purified by oligo(dT)-cellulose 
chromatography [21]. For dot-blot analysis, 15 jig of each 
sample of total RNA was applied with gentle vacuum in 200 \i\ 
of 2 x NaCl/Cit to a Gelman nylon membrane using the BRL 
dot-blot apparatus. The RNA samples were covalently at- 
tached to the nylon membrane by ultraviolet irradiation fol- 
lowed by baking at 80 °C under vacuum. 

For Northern analysis 40 ug of each total RNA sample or 
4 ug of poly(A)-rich selected RNA were subjected to electro- 
phoresis on a 1 .4% agarose gel under glyoxal/dimethylsulf- 
oxide-denaturing conditions using Tris/acetate/EDTA as the 
running buffer. Subsequent to 50 mM NaOH treatment and 
washings in 2 xNaCl/Cit, the gels were stained by ethidium 
bromide and Northern blotted to Gelman nylon membranes 
[21]. 



DNA hybridization of cDNA library 

The cDNA library replica-plated on nylon membranes 
(Amersham, England) was probed with cDNA inserts labelled 
by nick translation [19] to a specific activity 2 - 5 x 10 8 cpm/ 
ug and a final concentration of 1 - 2 x 10 6 Cerenkov cpm/ml. 
The replica blots were prehybridized and probed at 42 °C 
for 15 h in 50% formamide, 5 x NaCl/Cit (lxNaCl/Cit is 
150 mM NaCl, 15mM sodium citrate, pH 7.0), 0.1% poly- 
vinylpyrrolidone, 0.1% Ficoll, 0.2% SDS and 100 ug/ml de- 
natured salmon sperm DNA. 

Following hybridization, the blots were washed at 65 °C 
for 2-4 h with several changes of 2 x NaCl/Cit, 0.2% SDS 
following by stringent washing at 65 °C (2x30 min) with 
0.2 x NaCl/Cit, 0.5% SDS. The washed blots were exposed to 



Northern and RNA dot-blot hybridizations 

The blots obtained as previously described, were prehy- 
bridized and probed at 42 °C for 16 h in 50% formamide, 
5 x NaCl/Cit, 0.1% polyvinylpyrrolidone, 0.1% Ficoll, 0.2% 
SDS and 100 ug/ml denatured salmon sperm DNA with 
cDNA inserts labelled by nick translation [19] to a specific 
activity of 2 — 5 x 10 8 cpm/ug. A final concentration of 1 - 
2 x 10 6 Cerenkov cpm/ml was used. Following hybridization, 
the blots were washed at 65 °C for 2-4 h with several changes 
of 2 x NaCl/Cit, 0.2% SDS followed by stringent washing at 
65 °C (2 x 30 min) with 0.2 x NaCl/Cit, 0.5% SDS. Quantifi- 
cation of the hybridization intensity was performed with the 
LKB 2222-020 Ultrascan XL II laser densitometer. Bound 
probes were removed by washing blots in hybridization buffer 
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W^flstruction of eukaryotic expression vector coding for H23 Ag 
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The Xmnl — EcoRl genomic fragment (see Fig. 6) was in- 
cited into the eukaryotic expression vector pCL642 (this 
.fetor will be described in detail in a separate publication). 
Itnefty pCL642 is composed of the promoter region (1 .4 kb) 
Isolated from the mouse housekeeping gene coding for 3- 
fevdroxy-3-rnethylglutaryl-coenzyme-A reductase. The pro- 
jfcjoter is followed by the untranslated first exon and intron 
?ifl 7 kb and 3.5 kb) derived also from this reductase gene. The 
site of the Xmnl — EcoRl genomic fragment coding for 
f the tumor antigen was blunt-end-ligated to an EcoRV site 
treated m a polylinker immediately downstream to the re- 
ductase intron. The EcoRl site was ligated to the Kpnl site of 
hhe polylinker via an EcoRl — Kpnl adaptor. A 123-bp 
S fragment containing the SV40 poly(A) signal sequence is 
| situated immediately 3' to the polylinker. The construct 
l"pCL642/ Xmnl -EcoRl (10 ug) was con transfected with 1 \ig 
l;pAG60(G418R) plasmid [22] into either MM5tC3H cells 
I (from American Tissue Culture Collection) or FR3T3 ras-1 
§ bells [23] using a modification [24] of the calcium phosphate 
I precipitation method [25]. Cells were selected for G418 resis- 
tance (Geniticin 500 ng/ml) and loci were picked and subcul- 
tured. 

For the detection of tumor antigen, the transfected cells 
H were grown on coverslips and immunohistochemically stained 
with H23 mAbs. Control cells were either transfected with the 
pAG60 plasmid alone or with an irrelevant gene. 



Nucleotide sequencing 

Sequencing was accomplished using the dideoxy nucleotide 
chain-termination method [26]. Restriction fragments of the 
cDNA inserts were subcloned into Ml 3 and both strands 
were sequenced. The ssDNA was primed with either the Ml 3 
universal primer or synthetic oligonucleotides prepared ac- 
cording to known sequences. The analysis of the sequence was 
performed using the Beckman MicroGenie program. 



Radioactive labelling of DNA probes 

Double-stranded DNA probes were radioactively labelled 
with [a- 32 P]dCTP either by nick translation or random 
oligonucleotide multiprirning- using commercially available 
kits (BRL, USA, and Amersham, England, respectively). All 
DNA probes used here were purified inserts that were isolated 
by agarose gel electrophoresis. Single-stranded oligonucleo- 
tides were 5'-end labelled by incubating with [y- 32 P]ATP and 
polynucleotide kinase. All labelled probes were purified from 
non-incorporated nucleotide by passage through Sephadex G- 
100 columns. 



Oligonucleotide synthesis 

Oligonucleotides were prepared at the Macromolecule 
Synthesis Service Unit (Department of Organic Chemistry, 
Weizman Institute of Science) by Dr Ora Goldberg using an 
Automated Applied Biosystems synthesizer. Following syn- 
thesis, the oligonucleotides were electrophoretically purified 
°n acrylamide/urea gels. 



Immunoblotting 

Protein samples denatured by boiling in SDS buffer con- 
taining mercaptoethanol were analyzed on 3—15% linear 
gradient SDS/acrylamide gels as previously described [27]. 
Following electrophoresis, the gel was equilibrated in transfer 
buffer (Tris/glycine) and electrotransferred for 3 h at 1 A to 
nitrocellulose filters. The filters were blocked in NaCl/P; 
(150 mM NaCl, 15 mM sodium phosphate, pH 7.0) contain- 
ing 5% skimmed milk (Blotto) followed by incubation with 
antibody in Blotto. The filters were washed in NaCl/Pi and 
then reacted with 125 I-protein A (Amersham, England) dis- 
solved in Blotto. 

Monoclonal antibodies 

Monoclonal antibodies (mAbs) were prepared against par- 
ticulate antigens released into the medium by T47D breast 
carcinoma cells, using established procedures. The mono- 
clonal antibodies obtained were screened against paraffin- 
embedded sections of benign and malignant breast tissue with 
the immunoperoxidase-staining technique and one of the 
mAbs designated H23 [2] was selected and used in this study. 



RESULTS 

Sequence of cDNA coding for epitope recognized 
by H23 monoclonal antibodies 

The gtl 1 cDN A expression libraries prepared with mRNA 
isolated from either T47D or MCF7 [17], both human breast 
carcinoma cell lines, were probed with the monoclonal anti- 
body H23. Libraries obtained by priming poly(A)-rich RNA 
with oligo(dT), as well as with random nucleotide oligomers, 
were investigated. Recombinant clones immunoreactive with 
H23, were obtained at a frequency of approximately 1 in 2000 
in the amplified libraries and the cDNA inserts of all clones 
analyzed revealed a size of approximately 220 — 240 bp. Both 
random oligomer-primed as well as oligo(dT)-primed libraries 
from the MCF7 and T47D cell lines revealed similar-sized 
inserts. 

Nucleotide sequencing of one such representative cDN A 
insert, termed 3b, indicates that it is (G + C)-rich with strand 
preference for the G or C nucleotides. Inspection of the 225-bp 
sequence shows that it is composed of a 60-nucleotide tandem 
repeat unit which is remarkably conserved with only very few 
substitutions occurring between the different units (Fig. 1 A). 

Longer cDNA inserts contain the same 60-bp 
tandem repeat unit 

In order to obtain longer cDNA clones, the 3b cDNA 
insert was used to reprobe the library by DNA/DNA hybrid- 
ization. Several recombinant phages with longer inserts were 
obtained, the longest of which is approximately 850 bp. 
Nucleotide sequencing of this insert indicated that it is solely 
composed of the same tandem 60-nucleotide repeat unit. Simi- 
larly other longer cDN A inserts obtained by 3b cDNA prob- 
ing of the library are also only composed of the tandem 
repeating unit. 

Restriction enzyme digestion of the isolated 850-bp insert 
with Smal (CCCGGG) completely reduces it to 60-bp frag- 
ments (data not shown) thus indicating that Smal sites appear 
at 60-bp intervals. 

A comparison of the repeat units found in the 3b cDNA 
with those present in the 850-bp insert (Fig: 1 A, B and C) 
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%struction ofeukaryotic expression vector coding for H23 Ag 

tlrhe Xmnl-EcoRl genomic fragment (see Fig. 6) was in- 
I 1 ? int0 the eukaryotic expression vector pCL642 (this 
Wt r will be described in detail in a separate publication). 
iFflv pCL642 is composed of the promoter region (1.4 kb) 
Pi«ted from the mouse housekeeping gene coding for 3- 
si^rnxv-3-methylglutaryl-coenzyme-A reductase. The pro- 
per is followed by the untranslated first exon and intron 
%°7 kb and 3.5 kb) derived also from this reductase gene. The 
itll site of the Xmnl - EcoKl genomic fragment coding for 
Ctmor antigen was blunt-end-ligated to an EcoRV site 
: -w*tcd in a polylinker immediately downstream to the re- 
} S a se intron. The EcoRl site was ligated to the *f I * 
^iVoolylinker via an EcoRl-Kpnl adaptor. A 123-bp 
: Sient containing the SV40 poly(A) signal sequence is 
Sed immediately 3' to the polylinker The construct 
\r\ (AllXmnl-EcoVA (10 jig) was contransfected with 1 \ig 
Ug60(G418R) plasmid [22] into either MM5tC3H cells 
£TmcncA Tissue Culture Collection) or FR3T3 ras-1 
Sis 1231 using a modification [24] of the calcium phosphate 
5 ipitation method [25]. Cells were selected for G418 resis- 
Snce (Geniticin 500 |ig/ml) and loci were picked and subcul- 

tUf For the detection of tumor antigen, the transfected cells 
were grown on coverslips and immunohistochemically stained 
with H23 mAbs. Control cells were either transfected with the 
pAG60 plasmid alone or with an irrelevant gene. 

Nucleotide sequencing 

Sequencing was accomplished using the dideoxynucleotide 
chain-termination method [26]. Restriction fragments of the 
cDNA inserts were subcloned into Ml 3 and both strands 
were sequenced. The ssDNA was primed with either the Ml 3 
universal primer or synthetic oligonucleotides prepared ac- 
cording to known sequences. The analysis of the sequence was 
performed using the Beckman MicroGeme program. 

Radioactive labelling of DNA probes 

Double-stranded DNA probes were radioactively labelled 
with [a- 32 P]dCTP either by nick translation or random 
oligonucleotide multiprimin^ using commercially available 
kits (BRL, USA, and Amersham, England, respectively) All 
DNA probes used here were purified inserts that were isolated 
by agarose gel electrophoresis. Single-stranded oligonucleo- 
tides were 5'-end labelled by incubating with [y- P]ATP ^and 
polynucleotide kinase. All labelled probes were purified from 
non-incorporated nucleotide by passage through Sephadex O- 
100 columns. 



lmmunoblotting 

Protein samples *Xed by boiling in SDS ^con- 
taining mercaptoethanol were analyzed on 3-15 A linear 
gradient SDS/acrylamide gels as prev,ously described [27]. 
Sowing electrophoresis, the gel was equilibrated 1m transfer 
buffer (Tris/glycine) and electrotransferred for 3 h , at A to 
nitroce lulose filters. The filters were blocked in NaCl/P, 
n 50 mM NaCl, 15 mM sodium phosphate, P H 7^0) containi- 
ng 5% skimmed milk (Blotto) followed by .ncubat.on with 
antibody in Blotto. The filters were washed m MaCl/P, and 
then reacted with 125 I-protein A (Amersham, England) dis- 
solved in Blotto. 

Monoclonal antibodies 

Monoclonal antibodies (mAbs) were \? r ^ e \f^ s ^J- 
ticulate antigens released into the medium by T47D breast 
carcinoma cells, using established procedures. Th ^ mono- 
clonal antibodies obtained were screened against paraffin- 
embedded sections of benign and malignant breast tissue with 
the immunoperoxidase-staining technique and one o the 
mAbs designated H23 [2] was selected and used in this study. 



Oligonucleotide synthesis 

Oligonucleotides were prepared at the Macromolecule 
Synthesis Service Unit (Department of Orgamc Chemistry, 
Weizman Institute of Science) by Dr Ora Goldberg using an 
Automated Applied Biosystems synthesizer. Following syn- 
thesis, the oligonucleotides were electrophoretically punlied 
on acrylamide/urea gels. 



RESULTS 

Sequence ofcDNA coding for epitope recognized 
by H23 monoclonal antibodies 

The etll cDNA expression libraries prepared with mRNA 
isolated from either T47D or MCF7 [17], both human breast 
carcinoma cell lines, were probed with the monoclonal anti- 
body H23. Libraries obtained by priming poly(A)-ncn KNA 
with oligo(dT), as well as with random nucleotide oligomers 
were investigated. Recombinant clones immunoreactive wrth 
H23 were obtained at a frequency of approximately 1 in 2000 
in the amplified libraries and the cDNA inserts of all clones 
analyzed revealed a size of approximately 220 - 240 1 bp. Both 
random oligomer-primed as well as oligo(dT)-pnmed hbranes 
from the MCF7 and T47D cell lines revealed similar-sized 

'^Nucleotide sequencing of one such representative cDN A 
insert, termed 3b, indicates that it is (G + C)-nch with strand 
preference for the G or C nucleotides. Inspection of the 225-bp 
Luence shows that it is composed of a 60-nucleotide tandem 
repeat unit which is remarkably conserved with only very few 
substitutions occurring between the different units (Fig. 1 A). 

Longer cDNA inserts contain the same 60-bp 
tandem repeat unit 

In order to obtain longer cDNA clones the s 3b cDNA 
insert was used to reprobe the library by DNA/DNA hybrid- 
ization. Several recombinant phages with longer inserts (were 
obtained, the longest of which is approximately 850 bp. 
Nucleotide sequencing of this insert indicated that ,t is solely 
composed of the same tandem 60-nucleotide repeat unit. Simi- 
larly other longer cDNA inserts obtained by 3b cDNA prob- 
ing of the library are also only composed of the tandem 

^ReSn enzyme digestion of the isolated 850-bp insert 
with Sma\ (CCCGGG) completely reduces it to 60-bp trag- 
ments (data not shown) thus indicating that Smal sites appear 
at 60-bp intervals. ,-niMA 
A comparison of the repeat units found in the 3b cDNA 
with those present in the 850-bp insert (Fig. 1A, B and C) 
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Fig. 1 . Nucleotide sequence of cDNAs that code for the epitope recognized by H23 mAbs and contain 60-bp tandem repeat units. The gtl 1 cDNA 
expression library was screened with H23 mAbs as described in Methods and the cDNA insert (indicated as 3B but referred to in the text as 
3b) obtained from a positive purified recombinant phage was subcloned in Ml 3 vectors in both orientations and sequenced (A). The 3b cDNA 
insert was purified, nick-translated and used to reprobe the library under stringent hybridization conditions as described in Methods. The 
longest cDNA insert ('850', i.e. 850 bp) thus obtained was subcloned in Ml 3 and both strands were partially sequenced (B and C). Only the 
C-rich strand is presented. The consensus sequence of the 60-bp repeat unit is shown at the top of the figure. Nucleotides in the repeat units 
identical to this sequence are indicated with dashes whilst substitutions are shown by an asterisk 
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Fig. 2. Northern blot analysis of human breast tumor RNA samples with 3b cDNA probe and synthetic complementary oligonucleotides 
derived from the repeating unit. RNA was extracted from human breast tumor tissue (lanes 1 and 3) or adjacent 'normal' breast tissue 
(lanes 2 and 4) from two separate individuals (lanes 1 + 2 and 3 + 4) and analyzed by glyoxal agarose gel electrophoresis followed by 
Northern blotting to nylon membranes and hybridization with (A) the 3b cDNA probe, (B) the C-rich oligonucleotide 5' 
AGCCCACGGTGTCACCTCGGCCCCGGACA 3' identical to nucleotides 15-43 of the concensus sequence presented in Fig. 1 A and (C) 
the complementary G-rich oligonucleotide 5' TGTCCGGGGCCGAGGTGACACCGTGGGCT 3'. The probe used in A was radioactively 
labelled by nick translation whilst those used in B and C were end -labelled by polynucleotide kinase and [y- 32 P]ATP as described in Methods. 
The blots were stringently^washed and autoradiographed at — 70°C. The full arrow to the left of the figure indicates the 3.6-kb hybridizing 
mRNA species whilst tfiat on the right points to the 6.0-kb mRNA species detected in the other sample. The open arrow indicates the position 
of 28S rRNA 



demonstrates remarkable conservation with very few nucleo- 
tide substitutions occurring between the repeats. 

Coding strand of the tandem repeat unit 

To determine the coding strand of the tandem repeat unit, 
Northern blots were probed with synthetic oligonucleotides 
complementary to either strand of the repeat unit. Probing a 
Northern blot containing RNA isolated from human breast 
samples (both tumor and adjacent 'normal' tissues) shows 
that the G-rich synthetic oligonucleotide hybridizes to mRNA 
species (Fig. 2). An identical hybridization pattern was ob- 



served when 3b cDNA was used to probe the same blot 
(Fig. 2). The breast tumor tissue has in the one case a hybrid- 
izing mRNA species of 6.5 kb whilst the second sample shows 
a single band at 3.6 kb. The corresponding RNA samples 
from adjacent 'normal' tissue are identically sized but much 
reduced in amount. In contrast to the above results, no hybrid- 
ization at all is seen with the second complementary C-rich 
oligonucleotide (Fig. 2). These findings confirm that RNA 
species containing multiple 60-nucleotide tandem repeats are 
bonafide transcripts. Moreover the orientation of transcrip- 
tion is demonstrated and the C-rich strand of the cDNA insert 
is the coding strand. 
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fig. 3. Comparative analysis of the flanking amino acid sequences with the 20-amino-acid repeat motif The amino acid sequence of the repeat 
motif is presented in the central boxed region and numbered from 1 to 20. The alternative amino acids that occur due to variations in the 
consensus sequence are indicated below the numbers. The 100 amino acids flanking the repeat motif on the amino and carboxyl terminals are 
shown (NH 2 and COOH, respectively). Flanking amino acids that are identical with the repeat motif are boxed in by the full line, whereas 
the flanking amino acids that appear in the same position every 20 amino acids are boxed in by a series of dots. Amino acids that vary from 
the repeat motif and appear at the same positions on either side of the repeat motif are indicated by * and are boxed in 
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Comparative analysis of the flanking amino acid sequences 
with the 20-amino-acid repeat motif 

The determined coding strand of the 60-bp cDNA could 
be translated in all three reading frames. As almost full-length 
gDNAs coding for the H23 Ag have recently been isolated 
[16a], the correct reading frame of the repeat motif could 
be readily identified (Fig. 3). The high level of nucleotide 
conservation amongst the various repeat units is reflected in 
the repeat-unit amino acid sequences (Fig. 3). The studies 
reported here (see also below) show that the tumor antigen 
has the unusual structure of highly conserved repeat units that 
compose at least 50% of the protein molecule. It is therefore 
of considerable interest to compare the similarity of flanking 
non-repeat amino acid. sequences with the 20-amino-acid re- 
peat motif itself. 

Several possibilities may be envisaged, (a) An abrupt break 
may occur in the continuity of the repeat motif and no simi- 
larity exist between the flanking amino acid sequences and the 
repeat units, (b) Some degree of similarity may exist between 
the flanking amino acid sequences and the repeat motif that 
declines with increasing distance away from the repeat array. 
Qr (c) the flanking amino acid sequences may retain similarity 
only with specific amino acids or regions of the consensus 
^peat motif. 

The comparative analysis (Fig. 3) shows that indeed simi- 
larity exists between the flanking amino acid sequences and 
toe repeat motif itself. However, this similarity is confined 
l o specific subregions such as the Val-Thr-Ser and Gly-Ser 
Peptides at residues 11—13 and 2 — 3, respectively, and occurs 
lrj the flanking amino acid sequences on both sides of the 



repeat motif. On the other hand, certain amino acid residues 
are conserved asymmetrically, i.e. either upstream or down- 
stream to the repeat motif. Significant conservation in the 
amino-terminal flanking sequences occurs with the proline 
residues (15 and 19) and the alanine residue (20), whereas 
threonine (4) and alanine (14) are appreciably conserved only 
in the carboxyMerminal flanking sequences. The conservation 
of the proline residue (19) in the upstream flanking sequences 
is particularly remarkable as it is located in the same position 
for 82 amino acids upstream to the repeat motif. Of further 
note are the amino acids that diverge from the repeat motif : 
the asparagine residue that replaces threonine (1 7) does so on 
both sides flanking the repeat motif. Furthermore, proline (1) 
is replaced by leucine and followed 20 amino acids later by 
serine; it is indeed striking that identical changes occur both 
in the upstream and downstream flanking sequences. 

At the present time, the significance of repeat-motif amino 
acid conservation, as well as the identical amino acid changes 
occurring on both sides of the repeat, is not known. They may 
impose certain structural constraints on the protein molecule 
or/and be related to a function involving specific subregions 
of the repeat motif. 

Expression of tumor antigen mRNA in primary human tissues 

The in vivo system studies were extended and we investi- 
gated the presence of mRNA species hybridizing with the 3b 
cDNA probe in a variety of benign and malignant human 
tissues by RNA dot blotting and Northern blot analysis. The 
initial dot-blot screening demonstrates very significant levels 
of hybridizing mRNA species in total RNA prepared from a 
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Fig 4 Levels of RNA species in human tissues hybridizing with the 3b cDNA probe. Total RNA from different human tissues was dot-blotted 
and probed with (a) the 3b cDNA insert and (b) a cDNA insert (unpublished results) coding for part of human 18S "bosomal RN T*e key 
for the dot blots is as follows: (Al -8) BT2, BT4 } BT5, BT6, BN7, BT7, BN9, BN10; (Bl -8) BT10, BN12 BT12 BF13 BT15 BF16, PE1, 
PE2* (CI -8) ST1, ST2, KN1, KN2, KT2, UT1, UNI ; (Dl -8) CT1, CN1, CT2, LN1, LT1, blank, LT2; (El -2) AT, GT1. Abbreviations 
used are for RNA extracted from: BT - breast adenocarcinoma, ST - gastric carcinoma, KT = hypernephroma, UT = transitional cell 
carcinoma, CT = colon adenocarcinoma, LT = lung tumor, AT = pheochromocytoma and GT = ovarian carcinoma. The corresponding 
'N* samples (for example BN) represent RNA isolated from 'normal' tissue adjacent to the tumor. The same numbers indicate preparations 
from the same patient. The BF samples are breast fibroadenomas. PE samples are from pleural effusion metastatic cells of patients with 
advanced breast cancer, (c) The dot blots were scanned by laser densitometry using an LKB laser densitometer and the absorbance values 
obtained with the blot probed with the 3b cDNA probe were divided by the levels observed following 18S cDNA probing This procedure 
resulted in a normalized arbitrary unit corresponding to each sample which is presented on the ordinate of the figure. Total RNA extraction, 
dot blotting, hybridization and washing conditions were as described in Materials and Methods. The blots were exposed to Agfa Cunx X-ray 
films at — 70 °C with a Curix special intensifying screen 



number of breast carcinomas (Fig. 4 A). Several of these 
samples contain large quantities of mRNA capable of hybrid- 
izing with the 3b cDN A probe and a quantitative analysis 
demonstrates high levels of mRNA hybridization to the 3b 
cDNA probe (Fig. 4C). Significantly lower levels of hybridiza- 
tion are observed in RNA isolated from non-malignant breast 
tissue adjacent to the biopsied tumor sample. For example, 
Fig. 4 A shows that RNA isolated from tissue adjacent to 
tumor in samples BN7 (B = breast, N = normal), BN10 
and BN12 (dot-blot positions A5, A8 and B2, respectively) 
demonstrate hybridizing values of 5.5, 5.1 and 1 .8 (normalized 
probe-specific hybridization) whereas 3b cDN A hybridization 
to RNA extracted from the corresponding tumor samples BT7 
(B = breast, T = tumor), BTlO and BT1 2 (dot-blot positions 
A6, Bl and B3, respectively) shows considerably higher values 
of 53.0, 16.8 and 41.1 respectively). Two breast fibroadenomas 
(dot-blot positions B4 and B6) contain very low levels of 
hybridizing RNA. Of all benign breast tissues analyzed to 
date, only one sample that was pathologically classified as 
nonmalignant (BN9-A7 on the dot blot) contains significant 
levels of 3b cDN A hybridizing RNA. Interestingly, this sample 
was obtained from the second breast of a breast cancer patient 
who had undergone mastectomy several years earlier. In con- 
trast, mostly low, albeit detectable, levels of hybridization 
to the 3b cDNA probe are present in RNA extracted from 
stomach, colon and lung adenocarcinomas, as well as 
hypernephroma. Extremely low levels are seen in RNA iso- 
lated from a bladder carcinoma and undetectable levels of 



hybridization occurred with RNA from an adrenal pheo- 
chromocytoma, as well as in RNA extracted from chronic 
lymphocytic leukemic cells or from a brain neuroblastoma 
sample (data not shown). 

Analyses of human tumor RNA species by Northern blotting 

In order to determine by an independent method the va- 
lidity of the dot-blot analysis, the human-tissue RNAs were 
analyzed by probing Northern blots with the 3b cDNA probe 
(Fig. 5). Differences in both the sizes of the hybridizing 
mRNA species as well as in the relative levels are immediately 
evident. The relative levels obtained in the Northern blot 
analysis correlate well with those seen in the initial dot-blot 
screening. The most intensive hybridization is observed with 
RNA extracted from the breast tumor BT15 which yields a 
prominent RNA band located at the 6.5-kb position along 
with a significantly weaker band at approximately 3.6 kb 
(Fig. 5B, lane 7). A densitometry analysis indicates that hy- 
bridization in this breast-tumor RNA sample is 30-40-fold 
higher than that observed in other RNA samples analyzed on 
the same Northern blot. Much lower levels of hybridization 
of a 3.6-kb species are seen in RNA isolated from the two 
pleural effusions (lanes 5 and 9) which were revealed by longer 
exposure of the autoradiograph (data not shown). 

RNA extracted from the breast fibroadenoma (lane 6, 
Fig. 5 B) demonstrates only very low levels of hybridization 
with a 3.0-kb RNA species, whereas barely detectable levels 
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c : < Northern blot analysis of RNA species hybridizing with 3b cDNA. RNA samples isolated from a human hypernephroma KT1 1 (lane ) 
^ LVkidney non-malignant tissue KN1 (lane 2), thyroid nodular goiter (lane 3), thyroid follicular adenoma and lymphocytic thyroiditis 
* dj ^ 3b "from the pleural effusions of two patients with advanced breast carcinoma PE1 and PE2 (lanes 5 9) b«st r.brc^e^_BF16 
f D 2' brea adenocarcinoma BT15 (lane 7) and brain neuroblastoma (lane 8) were analyzed by agarose gel electrophoresis and eth.d.um 
° an h; \ ainine (AHolkwed by Northern blotting and probing with the 3b cDNA insert. (B) The autorad.ograph of 1 -day exposure is 
Tntd SR^State sarnies Z ana.yzed by Northern blotting and probing with the 3b cDNA insert as described 
Serials and Methods. All washings were performed under stringent conditions (0.2 x NaCl/C.t, twice for 30 mm at 60 C) 
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of hybridization at the 3.6-kb position are seen in one of the 
thyroid samples (lane 4, longer exposure, data not shown). 
Hybridization is not detected with RNA from a neuroblas- 
toma, one thyroid sample and non-malignant tissue of the 
hypernephroma (lanes 8, 3 and 2, respectively). 

The gene hybridizing with the repeat unit 

is polymorphic and is a VNTR gene: 

verification with unique non-repeat genomic sequences 

The presence of the 60-nucleotide tandem repeat unit in 
the cDNAs analyzed indicates that the gene coding for this 
protein probably also contains a variable number of tandem 
repeats and thus belongs to the class known as VNTR genes. 
In order to demonstrate the polymorphism occurring in such 
a gene, a Southern blot comprising EcoRl and EcoRl/Pstl 
double-digested DNA was prepared from a number of human 
tissue samples isolated from different individuals. Hybridiza- 
tion with either the 3b cDNA insert or with the larger 850-bp 
cDNA insert (previously described above) shows marked gene 
polymorphism with at least 11 different alleles evident in the 
9 samples studied (Fig. 6). Although the allelic patterns are 
similar on the EcoRl or double-digested EcoRl/Pstl DNA 
samples, the sizes of the different alleles following the double 
digestion are significantly smaller, thus increasing their 
electrophoretic resolution (Fig. 6). 

From the Southern blotand cDNA nucleotide sequencing 
data presented, it is concluded that (a) the different alleles 
result from differences in the number of repeat units, (b) the 
EcoRl and Pstl sites are situated outside the tandem repeat 
unit and (c) the Pstl sites are closer to the borders of the 
tandem repeat units than are the EcoRl sites. 

Polymorphism of this gene has also recently been described 
by two other groups [13-16]. However, the only probe used 
in the reported studies has been the 60-bp repeat unit [13- 
16]. Conclusive evidence that the gene is in fact an expressed 
VNTR gene requires probing of both Northern and Southern 
blots also with unique non-repeat sequences that are linked 
to the repeat array. 

We further verified the VNTR nature of the gene by re- 
probing the same Southern blot with a non-repeat DNA frag- 
ment excised from the cloned 7.5-kb EcoRl - EcoRl gene 
fragment, isolated from a genomic library by probing with the 



cDNA 60-bp repeat unit [34]. This fragment (a Smal-Pstl 
fragment, see Fig. 6), is approximately 1 kb and is situated 5' 
to the array of tandem repeat units. It should thus hybridise 
with a single identically sized DNA band in all samples that * 
have been EcoRl/Pstl double-digested. On the other hand, 
hybridization of this same fragment with an EcoRl -digested 
DNA should yield an identical hybridization pattern to that 
seen following hybridization with the repeat unit. These pre- 
dictions were confirmed by the results obtained. Hybridization 
of the EcoRl/ Pstl-digested samples with the 1-kb non-repeat 
fragment reveals a 3.5-kb band in all samples investigated 
(Fig. 6B, EcoRl + Pstl). This band is absent when the blot is 
probed with the repeat unit (compare Fig. 6 A and B 
EcoRl + Pstl digest). The lightly labelled additional bands 
designated by asterisks in Fig. 6B are the remnants of the first 
hybridization with the repeat unit (compare with Fig. 6 A, 
EcoRl + Pstl). Furthermore, hybridization of the EcoRl di- 
gest with the non-repeat fragment or with the repeat unit are 
identical, as predicted (compare Fig. 6A and B EcoRl). 

As expected, the larger allele that contains more repeat 
units than the smaller allele shows a stronger signal following 
hybridization with the repeat unit probe (see Fig. 6, lane 2 for 
example). 

The above data present evidence that the gene coding for 
the tumor-associated antigen is indeed a VNTR gene. 

The different alleles are codominantly transcribed 
into corresponding mRNA species 

We had previously seen significant heterogeneity in mRNA 
species that hybridize with 3b cDNA expressed in tumor 
samples isolated from different individuals. Two, and less 
often, only one hybridizing RNA band(s) are observed in any 
individual sample. As the gene itself is highly polymorphic, we 
investigated whether a correlation exists between the different 
allelic forms and the number and sizes of hybridizing RN As 
expressed (Fig. 7). 

Although, as noted above, 3b-hybridizing mRNAs are 
highly over-expressed in most malignant breast tissues, RNA 
isolated from other epithelial tumors also demonstrate hybrid- 
izing mRNA species albeit at lower levels. In order to establish 
the scope of this possible allele/mRNA correlation, investi- 
gations were performed both on non-breast and breast tumor 
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samples. Two breast tumor samples that express the lowest 
levels of 3b-hybridizing mRNA out of all malignant breast 
tissues analyzed were selected for comparison. In this regard 
it should be emphasized that the investigation was performed 
on nucleic acids isolated from primary human tissues rather 
than from cell culture lines. The conclusions of these exper- 
iments are therefore relevant to the in-vivo situation. In 10 
out of 10 primary human tumor samples investigated, full 
concordance is demonstrated between the number and sizes 
of alleles with the corresponding hybridizing mRNA species 
(Fig. 7 A and B). 

The different allelic forms probably vary due to a differ- 
ence in the number of tandem repeats. We thus investigated 



whether the corresponding mRNA species expressed in the 
same individual demonstrate an identical size difference. As 
the homozygotic breast tumor samples correspondingly ex- 
press one mRNA species, they were not included in this analy- 
sis. The results shown in Fig. 7 indicate that, within the accu- 
racy possible for DNA fragment and mRNA species size 
determination, the allelic size difference for the heterozygotic 
samples is equal to the difference in size of the two mRNA 
species. 

It is interesting to note that the mRNA species correlating 
with the larger allele gives a less intense hybridization signal 
than the smaller mRNA species (see Fig. 7, lanes 2, 3 and 8). 
We do not know whether this is a consequence of reduced 
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j?j g 7. Correlation of Southern and Northern blots containing DNA 
tiid RNA isolated from the same human tissue sample following hybrid- 
ization with the repeating unit. DNA (A) or total RNA (B) was isolated 
from the foDowing human tissues that had malignant tumors: stomach 
(lane 1), ovary (lanes 2 and 3), lung (lane 4), breast (lanes 5 and 7) 
and thyroid (lanes 8—10). The DNA after double digestion with 
EcoRl and Pstl or total RNA samples were electrophoresed on 
agarose gels and Southern (A) or Northern (B) blotted followed by 
hybridization with radioactively labelled 3b cDNA probe. The blots 
were stringently washed and autoradiographed at — 70 °C. All samples 
in A were run simultaneously on the same gel but lane 8 was exposed 
for a longer time as less DNA was available for analysis. On the 
Northern blot, samples 2 — 7 were run simultaneously on the same gel 
'W lane 6 was exposed for a longer time as there was significantly 
less mRNA expression in this sample. Samples 1 and samples 8 — 10 
were run on two separate gels. The positions of the hybridizing mRN A 
species are indicated in B by the upward or downward facing full 
arrows. Note that in lanes 2 — 7 (and especially in lane 6) some 
nonspecific hybridization with 28S rRNA has occurred 



transcription of the larger allele, reduced stability of the larger 
mRN A species or other mechanisms. 

In order to characterize further the correlation of allelic 
forms with the different mRNA species, both a Southern and 
Northern blot were rehybridized with the 1-kb non-repeat 
genomic fragment described above (Fig. 8 A and B). As ex- 
pected, probing the Northern blot with either the 3b cDNA 
tandem repeat units or with the 1-kb non-repeat fragment 
(Fig. 8B, lanes 1 and 2 respectively) reveals identical hybrid- 
izing mRNA species. On the other hand, reprobing the 
Southern blot with the 1-kb non-repeat fragment demon- 
strates only one band in contrast to the two allelic forms seen 
following probing with the repeat units (Fig. 8 A, lanes 2 and 
1 respectively). 

Expression of H23 Ag in cells stably transfected 
*ith the H23 Ag gene 

By probing Northern and Southern blots with both unique 
genomic sequences and the 60-bp repeat unit, we demon- 
strated the expression of a VNTR gene that codes for the 
H23 Ag. These critical experiments hinge on the physical link- 
i a ge, in the genomic fragment isolated, of unique non-repeat 
j E>NA sequences with the tandem repeat array. 




Fig. 8. Probing Southern and Northern blots with the repeating-unit 
cDNA probe and a non-repeat genomic fragment. High-molecular-mass 
DNA (A) or total RNA (B) was isolated from human lung tissue. The 
DNA, following EcoKl plus Pstl double digestion, and total RNA 
were electrophoresed on agarose gels and Southern (A) or Northern 
blotted (B). The blots were hybridized with the 3b repeating-unit 
cDNA probe (lane 1) or, after stripping, with the 1-kb non-repeat 
fragment of the gene (lane 2, see text). The blots were stringently 
washed and autoradiographed at — 70 °C. The full arrows indicate 
bands hybridizing with the 1-kb non-repeat genomic fragment whilst 
the open arrows show bands hybridizing to the 3b repeat-unit cDNA 
probe. The efficiency of stripping the Northern blot following the 
first hybridization with 3b cDNA (B, lane 1) was evaluated by blot 
autoradiography prior to the second hybridization: no signal at all 
was seen. The bands appearing in B, lane 2, are thus bona fide signals 



In order to confirm this linkage, mouse or rat cells were 
transfected with the isolated genomic fragment and then 
analyzed for H 23 Ag synthesis. We had previously determined 
by cDNA and genomic sequencing (unpublished results) that 
an Xmnl site is located 35 nucleotides upstream to the putative 
ATG initiation codon of the H23 Ag gene. The Xmnl — EcoRl 
gene fragment (see Fig. 6) was therefore isolated and inserted 
into a eukaryotic expression vector downstream to the pro- 
moter of a housekeeping gene, 3-hydroxy-3-methylglutaryl- 
coenzyme-A reductase. In order to obtain stable transfectants, 
the H23 Ag gene construct, pCL642/ Xmn-Eco, was cotrans- 
fected into mouse mammary tumor cells MM 5, with a plasmid 
coding for resistance to the antibiotic neomycin^ Similar trans- 
fections were conducted with c-Ha-ras-transformed rat fibro- 
blasts. Neither of these cell fines expressed any human epi- 
thelial tumor antigen detectable with H23 mAb. As a control, 
MM5 cells were separately stably transfected with a pCL642 
construct containing an irrelevant gene. (Details on the 
pCL642 eukaryotic expression vector are to be presented in a 
separate publication.) 

Both the MM5 and rat fibroblast stable transfectants were 
grown on coverslips and immunohistochemically stained with 
H23 mAb (Fig. 9). Whereas no staining is observed in control 
MM 5 and rat fibroblasts transfected with the non-relevant 
gene (Fig. 9 A' and B'), stable transfectants harboring the 
pCL642/ Xmn-Eco construct demonstrate intense staining, 
readily detected with the H23 mAb (Fig. 9 A and B). Staining 
is mainly cytoplasmic and is undetectable within the cell nu- 
cleus. 

Western blot analyses of cell proteins from the pCL642/ 
Xmn-Eco transfection demonstrate high-molecular-mass pro- 
teins (five bands ranging from 70 kDa to > 200 kDa) that 
are immunoreactive with H23 mAb (data not shown). These 
protein species are likely to represent H23 Ag glycosylated 
to varying degrees, thereby producing heterogenously sized 
immunoreactive products. Cell extracts from cells transfected 
with the non-relevant gene show no immunoreactive bands 
on the Western blot analysis. 
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Over-expression of the H23 Ag 

in primary human breast tumor tissue: Western blot analysis 

Having established (a) that in primary human tissues the 
gene polymorphism directly correlates with the mRNA species 
expressed and (b) that the mRNA coding for the antigen is 
over-expressed in breast tumor tissue, we next investigated the 
expression of antigen at the protein level 

A preliminary investigation was conducted on the human 
breast cell line T47D, which expresses large amounts of tumor 
antigen that are readily detectable by H23 mAbs. These cells 
were analysed at the gene, mRNA and protein levels. 

A Southern blot of an EcoRl/Pstl digest shows two allelic 
forms at 5.5 and 3.1 kb (Fig. 10 A, lane s). The Northern blot 
analysis correspondingly demonstrates two mRNA species 
(6.5 and 4.1 kb) that hybridize with the repeat-unit cDNA 
probe (data not shown). The protein products of these mRNA 
species were analyzed Uy immunoblotting which shows two 
products migrating on SDS-denaturing gels with molecular 
masses in the region of 250-450 kDa (Fig. 10A, lane w). No 
bands are observed when the immunoblot was probed with a 
non-specific monoclonal antibody under identical conditions. 
The alleles of the T47D gene are thus transcribed into mRNA 
species that are subsequently translated into distinct high- 
molecular-mass protein products that correlate with the re- 
spective mRNA and allelic sizes. 

In order to relate the above findings to an in-vivo system, 
these studies were extended, as with the RNA and DNA 
analyses, to primary human tissues. Extracts of human tissue 
samples were run on SDS-denaturing gels and the separated 
protein species immunoblotted and probed with H23 mAbs. 
Analyses were performed on malignant breast tumor tissue 
samples together with an extract from non-malignant breast 
tissue adjacent to the biopsied tumor sample. 



Fig 10. Correlation of alleles hybridizing to the repeat unit ^protein 
products detected by H23 monoclonal antibodies and H23 Ag 
overexpression in breast cancer tissue. DNA, double-dusted with 
EcoKl and Psil, or RNA isolated from T47D breast carcinoma cells 
were analyzed by agarose gel electrophoresis and Southern (A, s) or 
Northern blotted (not shown). The blots were hybridized to the : JD 
cDNA repeating unit probe, stringently washed and autoradio- 
graphed. For immunoblotting (A, B and C, w), samples were boiled 
for 3 min in SDS/mercaptoethanol sample buffer, and electrophoreses 
on a 4-15% SDS gradient gel, followed by electro-transter to a 
nitrocellulose membrane as in Methods, and reacted with H23 mftD. 
Sample A, iane w is the medium of T47D cells precipitated with 
ammonium sulfate; samples B and C are the protein ^^^/^ 
breast cancer tissue (lane 2) and the adjacent non-malignant oreasi 
tissue (lane 1). DNA extracted from the same samples (B and C, iane 
s) was restricted with EcoKl alone, Southern blotted and probed ww 
3b cDNA 

Probing the immunoblots with H23 mAbs demonstrates 
marked over-expression of the tumor antigen in the mahgnan 
breast tissue samples (Fig. 10B and C, lane 2). The non-malig- 



rm^m 



kheir malignant 
lune reactivity with 



wotein 
m Ag 
i with 
ta cells 
s) or 
the 3b 
iradio- 
boiled 
oresed 
;r to a 
. mAb. 
h 50% 
s from 
breast 
;, lanes 
;d with 



strates 
ignant 
malig- 



Ireast tissues, that were adjac 
Prparts, show significantly lower im 
§23 mAbs (Fig. 10B and C, lane 1). 
Jte previously shown for T47D, the polymorphism of the 
l^mmunoreactive protein species seen in the primary hu- 
"fbreast tissue samples correlates with the different allelic 
.^ observed in Southern blots probed with the 60-bp repeat 
|fc{Fig- and C, lane s). 



|cUSSION 

|l,The results presented here show that a highly polymorphic 
: contains a 60-bp tandem repeat array and codes for an 
pithelial tumor antigen that is over-expressed in human 
figp&st cancer. The H23 monoclonal antibody recognizes an 
feitope contained within the 20-amino-acid repeat motif 
lehcoded by the 60-bp cDNA and detects intracytoplasmic 
latitigen in 91 % of malignant breast tumors [2]. An almost 
Identical 60-bp cDNA insert has been isolated by two other 
Igroups [13 — 16] using monoclonal antibodies (DF3, HMFG- 
VlV HMFG-2 and SM3) that also recognize high-molecular- 
^.mass mucin-like glycoproteins aberrantly expressed in breast 
r cancer tissue. It seems likely that different post-translational 
modifications occur within the 20-amino-acid repeat motif, 
: encoded by the 60-bp cDNA, thus explaining, in part, the 
varying specificities of the different mAbs for normal and 
malignant breast tissue. 



The gene coding for tumor antigen 

As previously reported [13 — 16], the gene coding for the 
tumor antigen contains a variable number of tandem repeats 
and is highly polymorphic. We have extended this finding and 
probed restricted genomic DNA samples with unique non- 
- repeat sequences isolated from a genomic fragment that con- 
tains the tandem repeat array. This analysis demonstrates that, 
external to the repeat array, the gene does not exhibit any 
heterogeneity, thereby indicating that the genetic polymor- 
phism is solely due to varying numbers of the 60-bp tandem 
repeats. It is also demonstrated here that, besides expression 
of the 60-bp repeat units, unique non-repeat genomic se- 
quences are expressed into mRNA and translated into protein. 

The physical linkage of unique non-repeat sequences with 
the expressed 60-bp tandem repeat array was further con- 
firmed by transfection experiments. The isolated gene frag- 
ment, from which the unique repeat sequences are derived, 
was transfected into mouse and rat cells that do not normally 
express any tumor antigen detectable with H23 mAb. The 
transfectants thus obtained synthesize human tumor antigen 
that is readily detected by H23 mAb. Furthermore, these 
transfection studies provide strong evidence that the isolated 
cDNA and gene fragment are indeed bona fide sequences that 
eode for the human epithelial tumor antigen. 

\ Correlation of alleles with expressed mRNA species 
°nd protein products: studies with primary human tissues 

In a recent study involving only material derived from cells 
grown in culture [13], the gene polymorphism was found to 
correlate with both the mRNA species and protein forms 
detected. The different protein species observed in human 
urine by immunoblot analysis [16] also correlate with the 
^ various alleles. To our knowledge, there have been no reports 
/ demonstrating a concordance between the various alleles, 



mRNA species and p; ^B)rms in primary human tissues. 
We show here that in pninlrry human tumors full concordance 
exists between the alleles and the transcribed mRNA species. 
This is demonstrated for nucleic acids extracted from breast, 
ovary, lung, stomach and thyroid tissues. Furthermore, it is 
shown that the allelic and mRNA size differences are 
equivalent in every sample of primary human tissue analyzed. 
These studies indicate that the heterogeneity in mRNA species 
is also solely due to the number of tandem repeats that they 
contain. 

The correlative study of alleles and mRNA species in the 
same samples allows us to determine that approximately 
1 .9 kb in any individual mRNA species is represented by non- 
repeat sequences. The coding capacity of the tandem array is 
thus probably greater than 50% of the total protein, even in 
the smallest mRNA observed, and could code for more than 
65% in the larger mRNA species. 

Analyses of RNA samples from primary benign and malig- 
nant tumors demonstrate undetectable levels of hybridization 
in tissues of nonepitheliai origin, whereas several non-mam- 
mary epithelial adenocarcinoma tumors display low levels of 
hybridization. However, RNA extracted from three ovarian 
carcinomas shows significant levels of hybridization with the 
3b cDNA (an example of the intensity of hybridization is 
shown in Fig. 4 A, dot-blot position E2). A question of obvi- 
ous interest is whether this expression is due to the endocrine 
nature of these tissues. 

The highest levels of hybridizing mRNA species are detect- 
ed in malignant breast tumors. Non-malignant 'normal' tissue 
adjacent to the breast tumor samples, as well as non-malignant 
breast fibroadenomas, display much lower hybridization lev- 
els. The increased expression of the mRNA species hybridizing 
with the 3b cDNA probe thus strongly correlates with the 
malignant phenotype of the breast tissue. Although the mech- 
anisms involved in the increased expression are not known, 
they may be related to the de-differentiated state of malignant 
tissue. 

Since H23 mAbs detect an intracellular antigen primarily 
in breast tumor sections [2], the detection of hybridizipg RNA 
from non-breast tumors with the 3b cDNA probe,, albeit at 
low levels, is surprising. We have recently isolated unique- 
sequence cDN A that account for almost full-length cDNA of 
the tumor antigen [16a]. As several different alternatively 
spliced cDNAs were characterized, it is possible that the loss 
of epitope recognition by H23 mAb may be due to alternative 
splicing of the mRNA species in non-breast tissues. Other 
possibilities to be considered are different translational frames 
or simply a question of sensitivity of the immunohistochemical 
staining technique. 

The expression of the gene coding for the tumor antigen 
was also investigated at the protein level. This study shows 
that in primary human breast tumor tissue the polymorphism 
detected in the gene and mRNA species correlates with the 
protein products detected by immunoblotting. Moreover, it 
is quite obvious that the malignant breast tissue contains 
significantly higher levels of tumor antigen than adjacent nor- 
mal breast tissue. 

We and others [12] have described a 68-kDa protein species 
that can be precipitated with mAbs directed against this epi- 
thelial tumor antigen. The 68-kDa protein is not detected, 
however, using the conditions of the immunoblot technique 
described here. It may represent a partially glycosylated pro- 
tein or alternatively a breakdown product (induced proteo- 
lytically or otherwise), that contains a discrete number of 



repeat motifs. These possibilities are presently being investi- 
gated. 

The 20-amino-acid repeat motif : 
comparison with flanking amino acids 

The epithelial tumor antigen is composed of 20-amino- 
acid repeats that make up more than 50% of the total protein. 
This unusual structure of highly conserved repeat motifs has 
recently been documented for porcine submaxillary gland mu- 
cin [28], human intestinal mucin [29], a cell-surface antigen 
expressed by murine hemopoietic progenitor cells [30], human 
apolipoprotein (a) [31], apo-polysialoglycoprotein of rainbow 
trout eggs [32] and a repetitive protein from Xenopus laevis 
skin [33]. It is interesting to note that repeat elements from 
several of these proteins are also rich in serine and threonine 
residues. The function of the 20-amino-acid repeat motif, as 
of the complete epithelial tumor antigen itself, is unknown. It 
is striking, however, that specific amino acids and subregions 
of the repeat element are conserved in the flanking regions on 
both sides of the repeat array. Furthermore, in some cases, 
identical amino acids replace a repeat motif amino acid in the 
same position on both sides of the repeat array. Although we 
do not understand the significance of this gradual decline in 
similarity between flanking amino acids and the repeat motif, 
it may indicate that a specific function is related to a certain 
amino acid sequence of the repeat motif. 

Mouse cell transfectants synthesize the human epithelial tumor 
antigen : possible insights into tumor antigen function 

Mouse cells transfected with the isolated gene coding for 
the human epithelial tumor antigen synthesize protein readily 
detected with H23 mAb. The location of synthesized antigen 
is primarily cytoplasmic, although we cannot rule out the 
possibility that it may be bound to the endoplasmic reticulum 
and/or plasma membrane. Recent analyses of full-length 
cDNAs [16 a] show that differential splicing events occur 5' 
and 3' to the tandem repeat array giving mRNAs that will 
produce several forms of the antigen that localize to different 
cellular compartments. 

Using the transfected cells as a model system, we are now 
in a position to ask questions regarding the function of this 
epithelial tumor antigen: does it change the growth character- 
istics, morphology or/and tumorgenicity of the transfected 
cells? Preliminary resultsindicate that expression of the tumor 
antigen indeed changes the cell growth potential; these and 
other possible functions are presently being investigated. 

This work was supported by the Simko Chair for Breast Cancer 
Research, Frederico Fund for Tel Aviv University, grants from Center 
National Recherche Scientiflque, Mr Toby Green (London), Maurice 
and Frida Eskanasy Cancer Research Fund and the Israel Cancer 
Association (DHW). DHW, MH and IT were the recipients of 
European Molecular Biology Organization short-term and long-term 
(MH) Fellowships; DHW is the recipient of a Koret Foundation 
Fellowship, California. We thank Professor Pierre Chambon for con- 
tinued support and fruitful discussions. 



REFERENCES 

1. Keydar, I., Chen, L., Karby, S., Weiss, F. R., Delarea, J., Radu, 

M., Chaitchik, S. & Brenner, H. J. (1979) Eur. J. Cancer 15, 
659-670. 

2. Keydar, R., Chou, C. S., Hareuveni, M., Tsarfaty, I., Sahar, E., 

Seltzer, G., Chaitchik, S. & Hizi, A. (1989) Proc. Natl Acad. 
Sci. USA 86, 1362-1366. 



3. Abe, M. & Kufe, D. (1986) J. Cell. Physiol. 126, 126-J3 6 

4. Bramwell, M. E., Bhavanandan, V. P., Wiseman, G. & jjJ 

H. (1983) Br. J. Cancer 48, 177-183. 

5. Burchell, J. M., Durbin, H. & Taylor-Papadimitriou, J. (i9g| 

Immunol. 131 , 508-513. i 

6. Ceriani, R. L., Peterson, J. A. & Blank, E. W. (1984) Cancer R 

44, 3033-3039. : • 

7. Hilkens, J., Buijs, F., Hilgers, J., Hagemann, Ph., Calafat^ 

Sonnenberg, A. & Van der Valk, M. (1984) Int. J. Cancel 
197-206. 

8. Johnson, V. G., Schlom, J., Paterson, A. J., Bennett, J., Magna 

J. L. & Cokher, D. (1986) Cancer Res. 46, 850 - 857. 

9. Lan, M. S., Finn, O. J., Fernsten, P. D. & Metzgar, R. s. (19^ 

Cancer Res. 45, 305 - 310. 

10. Magnani, J. L., Steplewski, Z., Koprowski, H. & Ginsburg 

(1983) Cancer Res. 43, 5489-5492. " 

11. Price, M. R-, Edwards, S., Robins, R. A., Hilgers, J., Hilkeri 

J. & Baldwin, R. (1986) Eur. J. Can. Clin. Oncol 22, 115-11 

12. Burchell, J., Gendler, S., Taylor-Papadimitriou, J., Girling, * 

Lewis, A., Mills, R. & Lamport, D. (1987) Cancer Res. 4 
5476-5482. 

13. Siddiqui, J., Abe, M., Hayes, D., Shani, E., Yunis, E. & Kufe, 

(1988) Proc. Natl Acad. Sci. USA 85, 2320 - 2323. 

14. Gendler, S., Taylor-Papadimitriou, J., Duhig, T., Rothbard, J. 

Burchell, J. (1988) J. Biol. Chem. 263, 12820-12823. 

15. Gendler, S. J., Burchell, J. M., Duhig, T., Lamport, D., White 

R., Parker, M. & Taylor-Papadimitriou, J: (1987) Proc. Na 
Acad. Sci. USA 84, 6060 - 6064. ■ , 

16. Swallow, D. M., Gendler, S., Griffiths, B., Corney, G., Taylor^ 

Papadimitriou, J. & Bramwell, M. E. (1987) Nature 328, 82- 
84. 

16a. Wreschner, D. H., Hareuveni, M., Tsarfaty, I., Smorodmsky, N., 
Horev, J., Zaretsky, J., Kotkes, P., Weiss, M., Lathe, R., Dion, 
A. & Keydar, I. (1990) Eur. J. Biochem. 189, 463-473. 

17. Petkovitch, M., Brand, N. J., Krust, A. & Chambon, P. (1987) 

Nature 330, 445-450. 

1 8. Walter, P., Green, S. } Green, G., Krust, A., Bornert, J. M., Jeltsch, 

J. M., Staub, A., Jensen, E., Scrace, G., Waterfield, M. & 
Chambon, P. (1985) Proc. Natl Acad. Sci. USA 82, 7889- 
7893. 

19. Rigby, P. W. J., Dieckmann, M., Rhodes, C. $ Berg, P. (1977) /. 

Mol. Biol. 113, 237-251. 

20. Wreschner, D. H. & Rechavi, G. (1988) Eur. J. Biochem. 172, 

333-344. 

21. Aviv, H. & Leder, P. (1972) Proc. Natl Acad. Sci. USA69,\m~ 

1412. 

22. Colbere-Garapin, F., Horodniceanu, F., Kounlsky, P. & 

Garapin, A. C. (1981) J. Mol. Biol 150, 1-14. 

23. Matriceau, L. M., Glaichenhaus, R, Gesnel, M.C. & Breathnach, 

R. (1985) EMBO J. 4, 1435-1440. 

24. Wigler, S. F., Pellicer, A., Silverstein, S. & Axel, R. (1978) Celt 

14, 725 -731. 

25. Graham, F. & Van der Eb, A. (1973) Virology 52, 456-457 

26. Sanger, F., Nicklen, S. & Coulson, A. R. (1977) Proc. Natl Acad. 

Sci. USA 74, 5463-5467. 

27. Laemmli, U K. (1970) Nature 227, 680-685. 

28. Timpte, C. S., Eckhardt, A. E., Abernethy, J. L. & Hill, K. l. 

(1988) J. Biol Chem. 263, 1081-1088. 

29. Gun, J. R., Byrd, J. C, Hicks, J. W., Toribara, J. W., Import, 

D T A. & Kim, Y. S. (1989) J. Biol. Chem. 264, 6480 - 648/. 

30. Dougherty, G. J., Kay, R. J. & Humphries, R. K. (1989) J. Biol. 

Chem. 264,6509-6514. 

31. McLean, J. W., Tomlinson, J. E., Kuang, W. J., Eaton D. 

Chen, E. Y., Fless, G. M., Scanu, A. M. & Lawan, R- M. ( W 
Nature 330, 3 32-137. 

32. Sorimachi, M., Emori, Y., Kawasaki, H., Kitajima, D., lnow, 

S., Suzuki, K. & Inoue, Y. (1988) J. Biol. Chem. 263, 176/5 
17684. 

33. Hoffman, W. (1988) J. Biol. Chem. 263, 7686-7690. 

34. Tsarfaty, L, Hareuveni, M., Horev, J., Zaretsky, Weiss, M., 

Jeltsch, J. M., Gamier, J. M., Lathe, R., Keydar, I- * 
Wreschner, D. H. (1990) Gene, in the press. 



Entrez-PubMed 



Page 1 of 2 




Pub 




PubMed Nucteotide Protein Genome 

Search PubMed 



Structure 



PMC 



National 
Library 
of Medicine 

Taxonomy 




OMIM 



Be 



sill for 



Go Clear 



About Entrez 



o 



0 Limits 



Preview/Index 



History 



Clipboard 



Text Version 

Entrez PubMed 
Overview 
Help | FAQ 
Tutorial 

New/Noteworthy 
E-Utilities 

PubMed Services 
Journals Database 
MeSH Database 
Single Citation Matcher 
Batch Citation Matcher 
Clinical Queries 
LinkOut 
Cubby 

Related Resources 
Order Documents 
NLM Gateway 
TOXNET 
Consumer Heaith 
Clinical Alerts 
ClinicarTriais.gov 
PubMed Central 

Privacy Policy 



Display Abstract 



Show: 20 L i Sort 



Send to 



Details 
Text 



□ 1: J Parasitol. 2003 Apr;89(2):381-4. 



Related Articles, Lin 



Upregulation of cardiac cell plasma membrane calcium pump in 
a canine model of Chagas disease. 

Barr SC, Pannabecker TL, Gilmour RF Jr, Chandler JS. 

Department of Clinical Sciences, College of Veterinary Medicine, Cornell 
University, Ithaca, New York 14853, USA. scb6@cornell.edu 

We have previously demonstrated that cardiac myocytes isolated from the 
hearts of adult dogs develop rapid repetitive cytosolic Ca2+ transients, 
membrane depolarization, and cell contraction by mobilization of 
sarcoplasmic reticulum Ca2+ stores when exposed to a soluble factor from tt 
trypomastigotes of Trypanosoma cruzi. These findings led us to investigate 
the regulatory mechanisms of cytosolic Ca2+ in cardiac tissues from dogs 
chronically infected with T. cruzi. Expression of the plasma membrane 
calcium pump (PMCA) RNA and protein was determined by Northern and 
Western blotting, respectively, followed by densitometric analyses.^ 642-bj 
PMCA lb complementary DNA probe derived from canine epicardial tissue 
hybridized to 2 major transcripts (7.3 and 5.3 kb) in canine epicardium. 
Expression of the dominant transcript (7.3 kb) was 77% greater in cardiac 
tissues obtained from dogs with chronic T. cruzi infection (140 days after 
inoculation) in comparison with constitutive expression levels in normal dog; 
Monoclonal antibody 5F10, known to recognize all isoforms of the PMCA, 
was used to detect expression of the PMCA protein in epicardial tissue. 
Expression of a 142-kDa protein was increased by 58% in the cardiac tissues 
of infected dogs when compared with those from uninfected dogs. To 
establish a link between the upregulation of PMCA in dogs chronically 
infected with Chagas disease and the ventricular-based arrhythmias and 
myocardial failure that occur during this stage of disease both in dogs and 
humans, further study will be required. 

PMID: 12760659 [PubMed - indexed for MEDLINE] 
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Deciphering the Message in Protein Sequences: 
Tolerance to Amino Acid Substitutions 

Tames U. Bowie * John F. Reidhaar-Olson, Wendell A. Lim, 
} Robert T. Sauer 



An amino acid sequence encodes a message that deter- 
ges the shape and function of a protein. This message is 
Sly degenerate in that many different sequences can 
for proteins with essentially the same stnutrc; and 
activity. Comparison of different sequences with similar 
11 can Leal key features of the code and improve 
undemanding of hoW a protein folds and how it per- 
forms its function. 



THE GENOME IS MANIFEST LARGELY IN THE SET OF PRO- 
teins that it encodes. It is the ability of these proteins to fold 
into unique three-dimensional structures that allows them to 
function and carry out the instructions of the genome Thus, 
comprehending the rules that relate amino acid sequence to struc- 
ture is fundamental to an understanding of biological processes. 
Because an amino acid sequence contains all of the .informauor. 
necessary to determine the structure of a protein (1) it should be 
possible to predict structure from sequence, and subsequendy o 
Wer detailed aspects of function from the «™^ "j^' 
problems are extremely complex, and ,t seems unhkely tliat either 
will be solved in an exact manner in the near future. It may be 
possible to obtain approximate solutions by using experimental data 
to simplify the problem. In this article, we describe how an analysis 
of allowed amino acid substitutions in proteins can be used to 
reduce the complexity of sequences and reveal important aspects of 
structure and function. 



Methods for Studying Tolerance to 
Sequence Variation 

There are two main approaches to studying the tolerance of an 
amino acid sequence to change. The first method rehes on the 
process of evolution, in which mutations are either accepted or 
rejected by natural selection. This method has been extremely 
powerful for proteins such as the globins or cytochromes for which 
sequences from many different species are known (2-7). The second 
approach uses genetic methods to introduce amino acid changes at 
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specific positions in a cloned gene and uses selections or screens to 
identify functional sequences. This approach has been used to great 
advantage for proteins that can be expressed in bacteria or yeast, 
where the appropriate genetic manipulations are possible (3, 8-11). 
The end results of both methods are lists of active sequences that can 
be compared and analyzed to identify sequence features that are 
essential for folding or function. If a particular property of a side 
chain such as charge or size, is important at a given position, only 
side chains that have the required property will be allowed. Con- 
versely, if the chemical identity of the side chain is unimportant, 
then many different substitutions will be permitted. 

Studies in which these methods were used have revealed that 
proteins are surprisingly tolerant of amino acid substitutions (2-4 
11) For example, in studying the effects of approximately 150G 
single amino acid substitutions at 142 positions in lac repressor, 
Miller and co-workers found that about one-half of all subsntut.ons 
were phenotypically silent (11). At some positions, many different, 
nonconservative substitutions were allowed. Such res.due positions; 
play little or no role in structure and function. At omer rxis.nons „o f 
substitutions or only conservative substitutions were allowed. These f 
residues are the most important for lac repressor activity. > 

What roles do invariant and conserved side chains play in, 
proteins? Residues that are directly involved in protein function* 
such as binding or catalysis will certainly be among the rnostj 
conserved. For example, replacing the Asp in the catalyt.c triad ofl 
trypsin with Asn results in a 10<-fold reduction ,n 
similar loss of activity occurs in X repressor when a DNA bmdm| 
residue is changed from Asn to Asp (13). To c-^ 
function, however, these catalytic residues and b.ndin ; 
must be precisely oriented in three dimensions. <*»*gfj 
mutations in residues that are required for structure forrnat.or or 
stability can also have dramatic effects on actrvity < [W. W 
Hence, many of the residues that are conserved in sets of related 
sequences play structural roles. 



Substitutions at Surface and Buried Positions^ 

In their initial comparisons of the globin sequences, Perutz M 
co-workers found that most buried residues require nonpol . 
chains, whereas few features of surface side cha.ns are gen 
conserved (6). Similar results have been seen for a number at pro ^ 
families (2, I 5, 7, 17. 18). An example of the «^ "££,4 
surface versus buried sites can be seen in Fig. 1, which s ^ 
allowed substitutions in X repressor at res.due positions that , 
the d,mer interface but distant from the DNA binding surface^ 



!^A) Amino acid substitutions allowed in a 
region of A repressor. The wild- type se- 
* is shown along the center line. The al- 
substitutions shown above each position 
'identified by randomly mutating one to 
cbd° nS at a t * mc D V us i n g 3 cassette method 
jpp)ying a functional selection (9). (B) The 
ionai solvent accessibility (42) of the wild- 
side chain in the protein dimer (43) relative 
1 |S§^ S ame atoms in an AJa-X-Ala model tripep- 
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tjjsction after cassette mutagenesis. A histogram of side chain 
j>jyent accessibility in the crystal structure of the dimer is also 
jpwh in Fig. 1. At six positions, only the wild-type residue or 
ively conservative substitutions are allowed. Five of these 
options are buried in the protein. In contrast, most of the highly 
Kposed positions tolerate a wide range of chemically different side 
lains, including hydrophilic and hydrophobic residues. Hence, it 
xms that most of the structural information in this region of the 
rotein is carried by the residues that are solvent inaccessible. 




Constraints on Core Sequences 

(Because core residue positions appear to be extremely important 
^protein folding or stability, we must understand the factors that 
ictate whether a given core sequence will be acceptable. In general, 
ay int njy hydrophobic or neutral residues are tolerated at buried sites in 
oteins, undoubtedly because of the large favorable contribution of 
^Hydrophobic effect to protein stability (19). For example, Fig. 2 
lad of lows the results of genetic studies used to investigate the substitu- 
2)- A| ,c|is allowed at residue positions that form the hydrophobic core of 
riding! NH 2 -terminal domain of k repressor (20). The acceptable core 
mcir j ?quences are composed almost exclusively of Ala, Cys, Thr, Val, He, 
;idue$| |eu; Met, and Phe. The acceptability of many different residues at 
endy* acji core position presumably reflects the fact that the hydrophobic 
on ot ffect, unlike hydrogen bonding, does not depend on specific 
H^ uc Parings- Although it is possible to imagine a hypothetical 
dated ic£e structure that is stabilized exclusively by residues forming 
v lydfogen bonds and salt bridges, such a core would probably be 
" to construct because hydrogen bonds require pairing of 

~R^ors and acceptors in an exact geometry. Thus the repertoire of 
l^ble structures that use a polar core would probably be extreme- 
Jflpited (21). Polar and charged residues are occasionally found in 
cores of proteins, but only at positions where their hydrogen 
^tiding needs can be satisfied (22). 
We cores of most proteins are quite closely packed (23), but some 
°iume changes are acceptable. In X. repressor, the overall core 
°(ume of acceptable sequences can vary by about 10%. Changes at 
^dividual sites, however, can be considerably larger. For example, 
*° ?how n in Fig. 2, both Phe and Ala are allowed at the same core 
'ition in the appropriate sequence contexts. Large volume 
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phylogenetic studies, where it has been noted that the size decreases 
and increases at interacting residues are not necessarily related in a 
simple complementary fashion (5, 7, 17). Rather, local volume 
changes are accommodated by conformational changes in nearby 
side chains and by a variety of backbone movements. 



The Informational Importance of the Core 

With occasional exceptions, the core must remain hydrophobic 
and maintain a reasonable packing density. However, since the core 
is composed of side chains that can assume only a limited number of 
conformations (24), efficient packing must be maintained without 
steric clashes. How important are hydrophobicity, volume, and 
steric complementarity in determining whether a given sequence can 
form an acceptable core? Each factor is essential in a physical sense, 
as a stable core is probably unable to tolerate unsatisfied hydrogen 
bonding groups, large holes, or steric overlaps (25). However, in an 
informational sense, these factors are not equivalent. For example, in 
experiments in which three core residues of X repressor were 
mutated simultaneously, volume was a relatively unimportant infor- 
mational constraint because three-quarters of all possible combina- 
tions of the 20 naturally occurring amino acids had volumes within 
the range tolerated in the core, and yet most of these sequences were 
unacceptable (20). In contrast, of the sequences that contained only 



Fig. 2. Amino acid substitu- 
tions allowed in the core of X 
repressor. The wild-type side 
chains are shown pictorially in 
the approximate orientation 
seen in the crystal structure 
(43). The lists of allowed sub- 
stitutions at each position are 
shown below the wild-type 
side chains. These substitu- 
tions were identified by ran- 
domly mutating one to four 
residues at a time by using a 
cassette method and applying 
a functional selection (20). 
Not all substitutions are al- 
lowed in every sequence back- 
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'the appropriate hydrophobic re$idue r ^^ignihcant fraction were 
acceptable. Hence, the hydrophobic^^>f a sequence contains 
more information about its potential acceptability in the core than 
does the total side chain volume. Steric compatibility was intermedi- 
ate between volume and hydrophobicity in informational impor- 
tance. 



The Informational Importance of Surface Sites 

We have noted that many surface sites can tolerate a wide variety 
of side chains, including hydrophilic and hydrophobic residues. This 
result might be taken to indicate that surface positions contain little 
structural information. However, Bashford et a/., in an extensive 
analysis of globin sequences (4), found a strong bias against large 
hydrophobic residues at many surface positions. At one level, this 
may reflect constraints imposed by protein solubility, because large 
patches of hydrophobic surface residues would presumably lead to 
aggregation. At a more fundamental level, protein folding requires a 
partitioning between surface and buried positions. Consequently, to 
achieve a unique native state without significant competition from 
other conformations, it may be important that some sites have a 
decided preference for exterior rather than interior positions. As a 
result, many surface sites can accept hydrophobic residues individ- 
ually, but the surface as a whole can probably tolerate only a 
moderate number of hydrophobic side chains. 



Identification of Residue Roles from 
Sets of Sequences 

Often, a protein of interest is a member of a family of related 
sequences. What can we infer from the pattern of allowed substitu- 
tions at positions in sets of aligned sequences generated by genetic 
or phylogenetic methods? Residue positions that can accept a 
number of different side chains, including charged and highly polar 
residues, are almost certain to be on the protein surface. Residue 
positions that remain hydrophobic, whether variable or not, are 
likely to be buried within the structure. In Fig. 3, those residue 
positions in \ repressor that can accept hydrophilic side chains are 
shown in orange and those that cannot accept hydrophilic side 
chains are shown in green. The obligate hydrophobic positions 
define the core of the structure, whereas positions that can accept 
hydrophilic side chains define the surface. 

Functionally important residues should be conserved in sets of 
active sequences, but it is not possible to decide whether a side chain 
is functionally or structurally important just because it is invariant or 
conserved. To make this distinction requires an independent assay of 
ptotein folding. The ability of a mutant protein to maintain a stably 
folded structure can often be measured by biophysical techniques, 
by susceptibility to intracellular proteolysis (26) , or by binding to 
antibodies specific for the native structure (27, 28). In the latter 
cases, it is possible to screen proteins in mutated clones for the 
ability to fold even if these proteins are inactive. Sets of sequences 
that allow formation of a stable structure can then be compared to 
the sets that allow both folding and function, with the active site or 
binding residues being those that are variable in the set of stable 
proteins but invariant in the set of fiinctionai proteins. The DNA- 
binding residues of Arc repressor were identified by this method (8). 
The receptor-binding residues of human growth hormone were also 
identified by comparing the stabilities and activities of a set of 
mutant sequences (28), However, in this case, the mutants were 
generated as hybrid sequences between growth hormone and related 
hormones with different binding specificities. 



Implications ^^tructure Prediction 

At present, the only reliable method for predicting a 
resolution tertiary structure of a new protein is by identif 
sequence similarity to a protein whose structure is already kncr 
(29, 30). However, it is often difficult to align sequences as the \c\ 
of sequence similarity decreases, and it is sometimes impossible 
detect statistically significant sequence similarity between distan 
related proteins. Because the number of known sequences is f 
greater than the number of known structures, it would be advant^ 
geous to increase the reach of the available structural information h ' 
improving methods for detecting distant sequence relations and for 
subsequently aligning these sequences based on structural principles 
In a normal homology search, the sequence database is scanned with 
a single test sequence, and every residue must be weighted equal] 
However, some residues are more important than others and should 
be weighted accordingly. Moreover, certain regions of the protein 
are more likely to contain gaps than others. Both kinds of informa- 
tion can be obtained from sequence sets, and several techniques have 



Fig. 3. Tolerance of positions in the NH 2 -terminal domain of X repressor Kj 
hydrophilic side chains. The complex (43) of the repressor dimer (blue) and 
operator DNA (white) is shown. In (A), positions that can tolerate , 
hydrophilic side chains are shown in orange. The same side chains are sho*n j 
in (B) without the remaining protein atoms. In (C), positions that requite ; 
hydrophobic or neutral side chains are shown in green. These side chains arc 
shown in (D) without the remaining protein atoms. About three-fourths 
the 92 side chains in the NH 2 -terminal domain are included in both (B) & I 
(D). The remaining positions have nor been tested. Data are from {9, 14, * 
27, 44). 
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"^uscd to combine such informatioi^^» more appropriately 
f|hted sequence searches and alignme^re (37). These methods 
^ used to align the sequences of retroviral proteases with aspartic 
leases, which in rum allowed construction of a three-dimension- 
f|jt>del for the protease of human immunodeficiency virus type 1 
| Comparison with the recently determined crystal structure of 
protein revealed reasonable agreement in many areas of the 
icted structure (32). 

structural information at most surface sites is highly degener- 
Except for functionally important residues, exterior positions 
fenr to i rn P ortant chiefly in maintaining a reasonably polar 
: ace. The information contained in buried residues is also 
enerate, the main requirement being that these residues remain 
|drophobic. Thus, at its most basic level, the key structural 
Issage in an amino acid sequence may reside in its specific pattern 
hydrophobic and hydrophilic residues. This is meant in an 
fcrmational sense. Clearly, the precise structure and stability of a 
>tein depends on a large number of detailed interactions. It is 
ible, however, that structural prediction at a more primitive 
el can be accomplished by concentrating on the most basic 
rmational aspects of an amino acid sequence. For example, 
iphipathic patterns can be extracted from aligned sets of sequences 
d used, in some cases, to identify secondary structures, 
[f a region of secondary structure is packed against the hydropho- 
core, a pattern of hydrophobic residues reflecting the periodicity 
secondary structure is expected (33, 34). These patterns can be 
ired in individual sequences by hydrophobic residues on the 
itein surface. It is rare, however, for a surface position to remain 
ophobic over the course of evolution. Consequently, the am- 
•athic patterns expected for simple secondary structures can be 
clearer in a set of related sequences (6). This principle is 
trated in Fig. 4, which shows helical hydrophobic moment plots 
>j|the Antennapedia homeodomain sequence (Fig. 4A) and for a 
jrhposite sequence derived from a set of homologous homeodo- 
iSn proteins (Fig. 4B) (35). The hydrophobic moment is a simple 
ure of the degree of amphipathic character of a sequence in a 
i|£n secondary structure (34). The amphipathic character of the 
fee ot-helical regions in the Antennapedia protein (36) is clearly 
^aled only by the analysis of the combined set of homeodomain 
uences. The secondary structure of Arc repressor, a small DNA- 
ig protein, was recently predicted by a similar method (8) and 
rmed by nuclear magnetic resonance studies (37). 
specific pattern of hydrophobic and hydrophilic residues in 
no acid sequence must limit the number of different structures 
fven sequence can adopt and may indeed define its overall fold. If 
j is true, then the arrangement of hydrophobic and hydrophilic 
fdues should be a characteristic feature of a particular fold. Sweet 
J Eisenberg have shown that the correlation of the pattern of 
|jrophobicity between two protein sequences is a good criterion 
their structural relatedness (38). In addition, several studies 
cate that patterns of obligatory hydrophobic positions identified 
aligned sequences are distinctive features of sequences that 
the same structure (4, 29, 38, 39). Thus, the order of 
^phobic and hydrophilic residues in a sequence may actually be 
Ipient information to determine the basic folding pattern of a 
Wtein sequence. 

;h the pattern of sequence hydrophobicity may be a 
fracteristic feature of a particular fold, it is not yet clear how such 
perns could be used for prediction of structure de novo. It is 
portant to understand how patterns in sequence space can be 
e d to structures in conformation space. Lau and Dill have 
jPproached this problem by studying the properties of simple 
"uences composed only of H (hydrophobic) and P (polar) groups 
J two-dimensional lattices (40). An example of such a representa- 
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tion is shown in Fig^^^Plesidues adjacent in the sequence must 
occupy adjacent squares on the lattice, and two residues cannot 
occupy the same space. Free energies of particular conformations are 
evaluated with a single term, an attraction of H groups. By 
considering chains of ten residues, an exhaustive conformational 
search for all 1024 possible sequences of H and P residues was 
possible. For longer sequences only a representative fraction of the 
allowed sequence or conformation space could be explored. The 
significant results were as follows: (i) not all sequences can fold into 
a "native" structure and only a few sequences form a unique native 
structure; (ii) the probability that a sequence will adopt a unique 
native structure increases with chain length; and (iii) the native 
states are compact, contain a hydrophobic core surrounded by polar 
residues, and contain significant secondary structure. Although the 
gap between these two-dimensional simulations and three-dimen- 
sional structures is large, the use of simple rules and sequence 
representations yields results similar to those expected for real 
proteins. Three-dimensional lattice methods are also beginning to 
be developed and evaluated (41). 



Summary 

There is more information in a set of related sequences than in a 
single sequence. A number of practical applications arise from an 
analysis of the tolerance of residue positions to change. First, such 
information permits the evaluation of a residue's importance to the 
function and stability of a protein. This ability to identify the 
essential elements of a protein sequence may improve our under- 
standing of the determinants of protein folding and stability as well 
as protein function. Second, patterns of tolerance to amino acid 
substitutions of varying hydrophilicity can help to identify residues 
likely to be buried in a protein structure and those likely to occupy 



Fig. 4. Helical hydro- 
phobic moments calcu- 
lated by using (A) the 
Antennapedia homeodo- 
main sequence or (B) a 
set of 39 aligned homeo- 
domain sequences (35). 
The bars indicate the ex- 
tent of the helical re- 
gions identified in nucle- 
ar magnetic resonance 
studies of the Antenna- 
pedia homeodomain 
(36). To determine hy- 
drophobic moments, 
residues were assigned 
to one of three groups: 
HI (high hydrophobici- 
ty = Trp, He, Phe, Leu, 
Met, Val, or Cys); H2 
(medium hydrophobic- 
ity = Tyr, Pro, Ala, Thr, 
His, Gly, or Ser); and H3 (low hydrophobicity = Gin, Asn, Glu, Asp, Lys, 
or Arg). For the aligned homeodomain sequences, the residues at each 
position were sorted by their hydrophobicity by using the scale of Fauchere 
and Pliska (45). Arg and Lys were not counted unless no other residue was 
found at the position, because they contain long aliphatic side chains and can 
thereby substitute for nonpolar residues at some buried sites. To account for 
possible sequence errors and rare exceptions, the most hydrophilic residue 
allowed at each position was discarded unless it was observed twice. The 
second most hydrophilic residue was then chosen to represent the hydropho- 
bicity of each position. An eight-residue window was used and the vectors 
projected radially every 100°. The vector magnitudes were assigned a value of 
1 , 0, or - 1 for positions where the hydrophobicity group was H 1 , H2, or 
H3, respectively. 
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_\. 5. A representation of one com- 
pact conformation for a particular 
sequence of H and P residues on a 
two-dimensional square lattice. 
[Adapted from (40), with permis- 
sion of the American Chemical Soci- 
ety] 
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surface positions. The amphipathic patterns that emerge can be used 
to identify probable regions of secondary structure. Third, incorpo- 
rating a knowledge of allowed substitutions can improve the ability 
to detect and align distantly related proteins because the essential 
residues can be given prominence in the alignment scoring. 

As more sequences are determined, it becomes increasingly likely 
that a protein of interest is a member of a family of related 
sequences. If this is not the case, it is now possible to use genetic 
methods to generate lists of allowed amino acid substitutions. 
Consequently, at least in the short term, it may not be necessary to 
solve the folding problem for individual protein sequences. Instead, 
information from sequence sets could be used. Perhaps by simplify- 
ing sequence space through the identification of key residues, and by 
simplifying conformation space as in the lattice methods, it will be 
possible to develop algorithms to generate a limited number of trial 
structures. These trial structures could then, in turn, be evaluated by 
further experiments and more sophisticated energy calculations. 
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The structure, stability, and unfolding-refolding kinetics of Escherichia colt- 
expressed recombinant goat oc-lactalbumin were studied by circular 
SHL spectroscopy 8 X-ray crystallography anc sapped -flow 
measurements, and the results were compared with those of the , autn 
"rotein prepared from goat milk. The electric properties of Ae t*o 
nroteins were also studied by gel electrophoresis and >™- e f han S e chro- 
matography. Although the overall structures of the authentic and recom- 
binant proteins are the same, the extra methionine residue at the N 
taXus of the recombinant protein remarkably affects the native-state 
stability and the electric properties. The native state of the recombinant 
protein was 3.5 kcal/mol less stable than the authentic protein, and the 
recombinant protein was more negatively charged than the au hen ic 
one The recombinant protein unfolded 5.7 times faster than the authentic 
one Sough there were no significant differences in the refo dmg rates 
of the two proteins. The destabilization of the recombinant protein can be 
fully interpreted in terms of the increased unfolding rate of the _prote>n, 
indicating that the N-terminal region remains unorganized in the tran- 
sition state of refolding, and hence is not involved in the folding 
Son Ste of the protein. A comparison of the X-ray structures of 
Scombman a lactalbLin determined here with that of the authentic 
protein shows that the structural differences between the proteins are 
confined to the N-terminal region. Theoretical considerations for he 
Sfferences in the conformational and solvation free energies between the 
oroteins show that the destabilization of the recombinant protein is pri- 
marily due to excess conformational entropy of the N-terminal methion- 
me residue in the unfolded state, and also due to less exposure of 
hydrophobic surface on unfolding. The results suggest that when the N- 
terminal region of a protein has a rigid structure, expression of he pro- 
teTby £- coli, which adds the extra methionine residue, destabilizes the 
native state through a conformational entropy effect. It also shows that 
Sifferences in the Electrostatic interactions of the N-termmal amino group 
withTe side-chain atoms of Thr38, Asp37 and Asp83 bring about a 
difference in the pK, value of the N-terminal ammo group between the 
proteTns: resulting^ a greater negative net charge of the recombinant 
protein at neutral pH. & ^ press 

Keywords: recombinant goat a-lactalbumin; extra N-terminal methionine 
residue; protein folding; X-ray crystallography study; conformational 
entropy 
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Introduction 

The N -terminal sequence of a recombinant pro- 
tein expressed in Escherichia coli is known to start 
with formyl-methionine (Marcker & Sanger, 1964), 
which is in most cases subsequently processed by 
deform ylase enzyme (Adams, 1968; Takeda & 
Webster, 1968), and removed by methionine ami- 
nopeptidase to finally produce the N-terminal 
methionine-free recombinant protein. However, 
removal of the N-terminal methionine does not 
always take place, and about half of E. coli- 
expressed proteins contain the extra N-terminal 
methionine residue, because the aminopeptidase 
action depends on the nature of the penultimate 
amino acid residue (Moerschell et aL, 1990). There- 
fore, the effect of the N-terminal methionine resi- 
due, when present, on the structure, stability and 
folding of E. co/f-expressed recombinant proteins 
should be an important issue in biophysical and 
molecular biological studies that use such recombi- 
nant proteins, although this has not been taken 
seriously in most cases. 

The biological and physicochemical properties of 
the methionylated proteins expressed in E. coli 
may differ from those of the authentic proteins 
that do not have the N-terminal methionine. For 
example, recombinant hen egg-white lysozyme 
contains the N-terminal methionine residue (Miki 
et aL, 1987; Mine et aL, 1997) and has lower solubi- 
lity and stability than the authentic form (Imoto 
et aL, 1987). Similarly, recombinant apomyoglobin 
expressed in E. coli contains the extra N-terminal 
methionine residue and is less stable than the 
authentic protein (Hargrove et aL, 1994), On the 
other hand, the presence of the extra N-terminal 
methionine or the extension or truncation of the 
N-terminal residues does not interfere with the 
native-state stability in certain other globular 
proteins (Kordel et aL, 1989; Duverger et aL, 1991). 
In recombinant ribonuclease A, the extra N-term- 
inal methionine is even known to stabilize the 
native structure (Schultz & Baldwin, 1992; 
Aronsson et aL, 1995). However, details of the 
effects of the extra N-terminal methionine residue 
on the structure, stability, and folding of the pro- 
teins have not yet been well understood. 

a-Lactaibumin is a milk Ca 2+ -binding protein, 
which consists of 123 amino acid residues and has 
a molecular mass of 14,200 Da. The three-dimen- 
sional structure of a-lactalburnin from several 
mammalian species, including goat, cow, guinea 
pig, and human, has been determined by X-ray 
crystallographic analysis (Acharya et aL, 1991; Pike 
et aL, 1996), and it is very similar to the structure 
of c-type lysozyme, a homologous protein. a-Lac- 
talbumin has been used actively as a model protein 
in studies of protein folding (Sugai & Ikeguchi, 
1994; Kuwajima, 1989, 1996; Vanderheeren & 
Hanssens, 1994; Uchiyama et aL, 1995; Schulman & 
Kim, 1996; Arai & Kuwajima, 1996; Schulman et aL, 
1997; Wilson et aL, 1996; Shimizu et aL, 1996; 
Balbach et aL, 1996; Katsumata et aL, 1996; Kataoka 



et aL, 1997; ivuhlman et aL, 1997; Wu & Ki m lgQ7 
Pfeil, 1998; Ikeguchi et aL, 1998), because th 
protein readily adopts a molten globule stat S 
which is known to be identical with a folding i m C ' 
mediate (Kuwajima, 1989, 1996; Ptitsyn, 19%) 
under a variety of conditions, including those at 
low pH, at a moderate concentration of guanidin 3 
hydrochloride (GdnHCl), and in the absence 0 f 
Ca 2+ and other salts (Kuwajima, 1989, 19%) 
Recombinant a-lactalbumin expressed in' E. coli 
though containing the extra N-terminal methion' 
ine, has often been used in these studies of protein 
folding. A recent study has, however, shown that 
like recombinant hen egg-white lysozyme, recom- 
binant bovine a-lactalbumin is less stable than the 
authentic protein, although the lactose synthase 
regulatory activities of the recombinant and auth- 
entic proteins have been shown to be identical 
with each other (Ishikawa et aL, 1998). 

Here, we show that E. co/i-expressed recombi- 
nant goat a-lactalbumin is destabilized by the 
presence of the extra N-terminal methionine resi- 
due by as much as 3.5 kcal/mol and has a more 
negative electric net charge than the authentic pro- 
tein. It is concluded that the destabilization of the 
recombinant protein is primarily brought about by 
an extra conformational entropy of the methionyl 
residue in the unfolded state and that the more 
negative charge of the recombinant protein is 
caused by a decrease in the pK n value of the 
N-terminal amino group. Because the N-terminal 
methionine remarkably destabilizes recombinant 
a-lactalbumin, the role of the N terminus in the 
folding of this protein has also been investigated 
by stopped-flow circular dichroism (CD) studies of 
the unfolding and refolding kinetics of the recom- 
binant and authentic proteins. The destabilization 
of the recombinant protein is shown to be entirely 
interpreted in terms of an increase in the unfolding 
rate, indicating that the N terminus is not involved 
in the folding initiation site of a-lactalbumin. 

Results 

Structure of folded recombinant goat 
a-lactalbumin 

The recombinant wild-type protein was 
expressed in E. coli as inclusion bodies with a high 
yield (15 mg per litre of culture). The protein was 
soiubilized in 8 M urea and refolded in a redox 
buffer in the absence of urea at pH 8.5 and 4°C 
The process of refolding was monitored by 
reversed-phase HPLC (Uchiyama et aL, 1995), and 
the folded protein was purified (see Materials and 
Methods). The peptide and aromatic CD spectra of 
the recombinant protein were measured under 
native conditions (0 M GdnHCl (pH 8.0) at 25 °C), 
and compared with those of authentic goat oc-lac- 
talbumin (Figure 1(a) and (b)). There is no signifi- 
cant difference in the CD spectra between the 
proteins in the aromatic and peptide regions, so 
that the secondary and tertiary structures of the 
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Figure 1. (a) Far and (b) near-UV CD spectra of auth- 
entic and recombinant goat oc-lactalbumin measured in 
the presence of 1 mM CaCl 2 at pH 7.0 and 25 °C. The 
continuous line denotes the authentic protein and the 
filled squares denote the recombinant protein- 



two proteins are essentially identical with each 
other. This conclusion is confirmed by the X-ray 
crystal! ographic structure of recombinant goat 
a-lactalbumin (see below). The results thus indicate 
that the folded recombinant protein is correctly 
folded into the native structure. A study has also 
shown that the lactose synthase regulatory activity 
of the folded recombinant protein is the same as 
that of authentic a-lactalbumin (Uchiyama et a\. f 
1995). 

Equilibrium unfolding 

The GdnHCl-induced equilibrium unfolding 
transition of the folded recombinant protein was 
studied by the peptide and aromatic CD spectra, 
arid the results were compared with those of auth- 
entic goat a-Iactalbumin. Figure 2 shows the 
Unfolding transition curves of the two proteins 
Measured by the CD ellipticities, at 222 and 
270 nm, and these ellipticities are expressed by the 
annpmnt fr^rfmnal extent (F^) of unfolding as a 
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Figure 2. GdnHCl-induced unfolding transition curves 
for authentic, recombinant, and methionine- free goat 
a-lactalbumin. The unfolding was carried out at 25 °C in 
the presence of 1 mM CaCl 2/ 50 mM NaCl, and 50 mM 
sodium cacodylate (pH 7.0), and the transitions were 
monitored by both far and near-UV CD measurements- 
Apparent fractions of unfolded species (f a£ n) were 
plotted against the concentration of GdnHCl. Open 
black circles and open red dries denote the F app values 
measured at 222 nm for authentic and recombinant 
proteins, respectively; open black triangles and open red 
triangles represent the F app values measured at 270 nm 
for authentic and recombinant proteins, respectively; 
and the F app values of the methionine-free recombinant 
protein measured at 222 nm are presented by filled blue 
diamonds. 



From Figure 2, the unfolding transition curves 
measured at 222 and 270 nm are coincident with 
each other in authentic and recombinant a-lactal- 
bumin, indicating that the unfolding transitions of 
the two proteins are well represented by a two- 
state mechanism, in which only the native (N) and 
the fully unfolded (U) states are populated in the 
transition zone as: 

N-U (1) 

Here K v is the equilibrium constant of unfolding 
and relates to the free energy change, AG Uy of the 
unfolding transition as: 

Ku^expt-AGu/RT) (2) 

where R and T are the gas constant and the absol- 
ute temperature, respectively, and AG y is assumed 
to be linearly dependent on GdnHCl concentration 
(C) as: 

AGy — AGy 2 ° - mC = m(C m - C) (3) 

where AGy 3 ° is the AG V in the absence of the 
denaturant, C m is the C at the midpoint of the 
unfolding transition, and m represents the depen- 
dence of AGy on C and is a measure of the coop- 
era tivity of the transition (Pace, 1986). From 



Table 1. Equilibrium u ,g transition parameters of goat a-Iactalbui 



AG^° m (kcal/ AAC[j 2 ° AAGu (kcal/m ol ) 

Name of protein (kcal/mol) mol M) C m (M) (kcal/mol) at 3.2 M GdnHQ 

Authentic goat a-lactalbumin 13.8 ± 0.7 4.4 ± 0.2 3.15 ± 0.01 

Recombinant goat a-lactalbumin 10.4 ± 0.5 3.9 ± 0.2 2.67 ± 0.01 -33 ~L9 



by F app is given as a function of C as: 

r , n exp[-m(C m -Q/Rr| 
app * ' " ! + exp[~m(C m - C)/Rr| V ' 

The values of m, C m , and hence AG" 2 °, for recom- 
binant and authentic a-lactalbumin were calculated 
from the data of Figure 2 by the non-linear least- 
squares method. The unfolding parameters m, C m , 
and hence AG" 20 , thus obtained are summarized 
in Table 1. The continuous lines in Figure 2 are the 
curves theoretically drawn with the parameter 
values of Table 1, and show excellent agreement 
between theory and the experimental data. 

Figure 2 also shows that the unfolding transition 
of the recombinant protein occurs at a remarkably 
lower concentration of GdnHCl (C m = 2.7 M) than 
the transition of authentic a-lactalbumin 
(C m = 3.2M). The difference in AGy (AAGu) is 
-3.5 kcal/mol at 0 M GdnHCl and -1.9 kcal/mol 
at 3.2 M GdnHCl, which is the C m for the authentic 
protein (Table 1). Therefore, the folded recombi- 
nant protein is remarkably less stable than auth- 
entic a-lactalbumin, although their native 
structures are practically identical as evidenced by 
the CD spectra and X-ray structural analysis. 



Gel electrophoresis and 
ion-exchange chromatography 

In order to investigate further differences 
between recombinant and authentic a~lactalbumin 
the electrophoretic and ion-exchange chromato^ 
graphic behavior of the two proteins were investi- 
gated. Figure 3(a) shows electrophoretic patterns in 
a non- denaturing polyacrylamide gel at pH 9.4. It 
can be seen that the electrophoretic mobility of the 
recombinant protein is significantly greater than 
that of the authentic protein. Figure 3(b) shows the 
elution profiles of recombinant and authentic a-lac- 
talbumin in an anion-exchange HPLC using a 
RESOURCE™ Q column (Pharmacia Biotech) with 
a' linear gradient from 0 M to 0.5 M NaCl in the 
presence of 10 mM NaH 2 P0 4 -Na 2 HP0 4 buffer 
(pH 7.0). The retention time is longer for the 
recombinant protein (22.9 minutes) than for the 
authentic one (19.6 minutes). Both of these results 
indicate that the recombinant protein is more nega- 
tively charged. These differences in the electric 
properties of the two proteins, however, disappear 
in the U state in 8 M urea. The electrophoretic 
mobilities and the chromatographic retention times 
of the proteins were found to be identical in the 
presence of 8 M urea (data not shown). Therefore, 
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Figure 3. (a) Non-denaturing polyacrylamide gel electrophoresis of authentic and recombinant goat a-lactalbumin. 
The electrophoresis was carried out at pH 8.9 using 12% (w/v) acrylamide gel at room temperature. About 5 jig pro- 
tein samples were applied in each lane. Lanes 1 and 2 contain authentic protein, lanes 3 and 4 contain recombinant 
protein, and lanes 5 and 6 contain an equimolecular mixture of authentic and recombinant protein. Lanes are num- 
bered from left to right in (a) and (c). (b) Superimposed HPLC pattern of authentic and recombinant goat a-lactalbu- 
min. The HPLC was performed with a Resource-Q anion exchange column at pH 7.0 using a linear gradient of 0 M- 
0.5 M NaCl containing 10 mM NaH 2 P0 4 -Na 2 HP0 4 buffer. A 50 ul protein sample containing 50 and 60 Mg of nat,ve 
and recombinant protein, respectively, was applied in the HPLC column, (c) SDS/polyacrylamide gel clecrophoresis 
of authentic and recombinant goat a-lactalbumin. The electrophoresis was carried out using 15% (w/v) acrylarm e 
gel at room temperature. Approximately 10 ug of protein was applied in each lane. Lane 1 contains low molecu a 
mass marker proteins, lane 2 is blank, lanes 3 and 4 contain the authentic protein, lanes 5 and 6 contain the recom i- 
nant protein and lanes 7 and 8 contain an equimolecular mixture of the < uthentic and recombinant proteins. 
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difference in the electric charge between the 
proteins must be caused by the structural folding 
of the proteins into the native structure. 

SDS/polyacrylamide gel electrophoresis was 
a j s o carried out for the recombinant and auth- 
entic proteins using 15% acrylamide in the resol- 
ving (Figure 3(c)). The electrophoretic 
mobilities of the two proteins are the same 
w ithin the experimental error, indicating that 
there is no significant difference in the molecular 
inass between the proteins. 

N-terminal sequence and mass 
spectrometry analyses 

In order to identify any differences in the amino 
acid sequence, we performed N-terminal sequen- 
cing and mass spectrometric analysis of the recom- 
binant and authentic proteins. The N-terminal 
sequences of the first five residues of the two pro- 
teins have shown that recombinant ot-lactalbumin 
contains an additional methionine residue. The 
results of the mass spectrometric analysis indicate 
that the difference in mass between the recombi- 
nant and authentic proteins is 133 (Figure 4), 
which is nearly equal to the mass of a single meth- 
ionine residue (131.19), confirming the presence of 
the extra methionine residue in the recombinant 
protein. Therefore, the only chemical difference 
that brings about the difference in the electric 
charge between the two proteins in the N state is 
the presence or absence of the extra methionine 
residue at the N terminus, and this difference may 
also lead to the remarkable difference in stability 
between the proteins. 

Methionine-free recombinant a-lactalbumin 

In order to directly investigate the effect of the 
extra methionine residue on the electric properties 
and stability of the recombinant protein, methion- 
ine-free recombinant a-lactalbumin was prepared 
by cyanogen bromide (CNBr) cleavage. Because 





Figure 4. MALDI-TOF-MS mass spectroscopic pattern 
of authentic and recombinant goat a-lactalbumin. The 
upper trace is for recombinant and the lower one is for 
authentic protein. 



there is no methionine residue in authentic goat 
a-lactalbumin, only the extra N-terminal methion- 
ine of the recombinant protein is expected to be 
removed by the CNBr cleavage. The removal of 
the methionine was confirmed by N-terminal 
sequencing and mass spectrometric analysis (data 
not shown). The absence of other cleavage pro- 
ducts was confirmed by SDS/polyacrylamide gel 
electrophoresis. The near and far-UV CD spectra of 
the methionine-free recombinant protein overlap 
with those of the authentic and original recombi- 
nant proteins (data not shown). The electrophoretic 
mobility in the native gel and the retention time 
for the anion-exchange chromatography were 
found to be identical with those of the authentic 
protein (data not shown). The stability of the 
methionine-free recombinant protein against the 
GdnHCl-induced unfolding was investigated, and 
the equilibrium unfolding transition of the meth- 
ionine-free protein is shown in Figure 2. The 
unfolding transition curve coincides well with that 
of the authentic protein, and gives the same C m 
and AG" 20 values. As a control, the authentic 
protein was also subjected to the conditions of 
CNBr cleavage, and it was confirmed that the 
unfolding transition of the protein was not affected 
by the cleavage conditions (data not shown). These 
results thus clearly indicate that the observed 
destabilization and the difference in the electric 
charge of the recombinant protein is solely due to 
the presence of the extra N-terminal methionine 
residue. 



Kinetics of refolding and unfolding 

The above results indicate that the presence of 
the extra methionine residue at the N terminus of 
the recombinant protein decreases the relative stab- 
ility of the N state by as much as 3.5 kcal/mol. 
Thus, it appears that both recombinant and auth- 
entic goat a-lactalbumin are useful for investi- 
gating the role of the N-terminal residue in the 
kinetic folding of a-lactalbumin. The kinetic 
unfolding and refolding reactions of the recombi- 
nant and authentic proteins were investigated by 
stopped-flow CD measurements. The unfolding 
and refolding reactions were induced by concen- 
tration jumps of GdnHCl from 1.0 to 5.4 M and 
from 5.5 to 0.5 M, respectively. The reactions were 
monitored by the ellipticity change at 225 nm at 
pH 7.0 and 25 °C. The kinetic progress curves for 
unfolding and refolding are shown in Figure 5(a) 
and (b), respectively, and the data were fitted by 
the non-linear least-squares method with the 
equation: 

,4(0 = A(oo) + A/4 obs a < exp(-*/0 ( 5 ) 

where A(t) and i4(oo) are the observed values of 
the ellipticity at time t and infinite time, respect- 
ively, Ay4 obs is the observed total amplitude 
[A(0) - A(oo)] t and fc f - and a, are the apparent first- 
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Figure 5. GdnHCbinduced (a) unfolding and (b) 
refolding kinetic progress curves of authentic and 
recombinant goat a-lactalbumin. The unfolding was 
initiated by a concentration jump from 1.0 M to 5.4 M 
and the refolding process was initiated by a concen- 
tration jump of 5.5 M to 0.5 M at 25°C in the presence 
of 1 mM CaCl 2 , 50 mM NaCl, and 50 mM sodium caco- 
dylate, pH 7,0, and the refolding and unfolding kinetics 
were monitored by the measurement of CD ellipticity at 
225 nm using stopped-flow CD. The continuous line 
denotes authentic protein and the filled squares denote 
recombinant protein, (b) The inset shows the refolding 
progress curve within two seconds and the same nota- 
tions are used for the transition curves. Theoretical 
kinetic progress curves are also shown in (a) and (b). 



order rate constant and fractional amplitude, 
respectively, of the ith kinetic phase. 

The kinetic progress curves for unfolding for 
both the recombinant and authentic proteins were 
well fitted to a single-exponential equation, and 
the apparent rate constants and the amplitudes for 
the two proteins are presented in Table 2. The 
kinetic progress curves for refolding were well 



Table 2. 

talbumin 



unfolding parameters of g 0a t or.q ac 



Name of protein 



*i (s *) (deg cm ? dmol^i) 



Authentic goat 
a-lactalbumin 

Recombinant goat 
a-lactalbumin 



1.26 ±0.01 
7.18 ±0.08 



-8384 
-8056 



fitted to the three-exponential equation, and the 
rate constants and the amplitudes are presented in 
Table 3. The unfolding reaction of recombinant 
a-lactalbumin is 5.7-times faster than that of the 
authentic protein, while there are no significant 
differences in the rate constants for the triphasic 
refolding reactions of the two proteins. Thus, it 
appears that the N-terrninai end of goat a-lactalbu- 
min is not essential for the kinetic folding of this 
protein (see Discussion). 

X-ray crystaflographic study 

In order to further investigate the differences in 
the folded structure between recombinant and 
authentic goat a-lactalbumin, an X-ray crystallo 
graphic analysis of the recombinant protein was 
performed, and the structure was compared with 
that reported for the authentic protein structure. 
The crystallographic data are summarized in 
Table 4. The space group of the crystal of the 
recombinant protein was altered to P2 1 2 1 2 from 
P2 } in which the authentic protein was packed 
(Pike et a\., 1996). The number of protein molecules 
in the asymmetric unit was one, although there 
were two (Mol A and Mol B) in the authentic pro- 
tein crystal. The final R and free R factors were 
0.191 and 0.278 in the resolution range of 8.0 to 
2.0 A. The overall error was estimated at 0.19 A by 
a Luzzati plot (Luzzati, 1952). As the space group 
is altered in the recombinant protein crystal, the 
N-terminal methionine may affect the molecular 
packing in the crystal. However, the interactions 
between the two independent authentic molecules 
(Mol A and Mol B) were found to be very similar 
to the interactions between the symmetry-related 
recombinant molecules (Figure 6). 

The structural differences between the recombi- 
nant and the authentic proteins are shown in 
Figure 7, which represents the distances between 
the C* atoms of the two molecules. The root-mean- 
square deviations of the main-chain atoms are 
0.55 A between the recombinant protein molecule 



Table 3. Kinetic refolding parameters of goat a-lactalbumin 



Name of protein 




(deg cm 2 dmol x ) 


k 2 (s-') 


(deg cm 2 dmol" 1 ) 


*3 (s- 1 ) 


(deg cm 2 dmol~ ) 


Authentic goat 
a-lactalbumin 

Recombinant goat 
a-lactalbumin 


0.11 ±0.05 
0.09 ± 0.04 


64.28 
70.8 


1.3 ±1.1 
1.3 ±0.4 


145 

335 


4.9 ± 0.3 
5.7 ± 0.4 


22S2 
2234 
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a kle 4. Crystallization, data collection, and refinement 
Statistics of recombinant goat ot-iactalbumin 

;f£- — : " 

Crystallization 
X^servoir solution 



Protein concentration 
temperature (°C) 

B. Crystal data 
Space group 

a, b, c (A) 

an asymmetnc unit 
X-ray generator 

Resolution at measurements (A) 
Total number of ind. refl. 

Completeness ( /o) 

C. Structure determination 
Method 

Model structure 
Software 

D. Refinement 
Software 

Resolution range (A) 
R-factor b 

Rfree* 

Rms deviations in: 
Bond length (A) 
Bond angles (°) 



1.0 mM CaCl 2 
16-20% PEG8000 
0.05 M KH 2 P0 4 
P H 6.0 

20 mg protein /ml 
20 



P2,2 7 2 

44.9, 88.9, 32.2 
1 

Cu target (4.5 kW) 

175 

12,533 

0.069 

92.5 



Mol. replacement 
Baboon ot-LA 
X-PLOR 3.1 



X-PLOR 3.1 
8.0-2.0 
0.191 
0.278 

0.010 
1.554 



• = E„ ZiWr i) - (I(h))\/X fl S f /(fc, i), where l{h, f) is the 

intensity value of the ith measurement of h and (1(h)) is the cor- 
responding mean value of 1(h) for all i measurements. 

b R-factor=r||F^ s t-|F cak |l/|F ob ,l, where |F obs | and |F calc | 
are observed and calculated structure factor amplitude respec- 
tively. 

c R free is the same as R -factor, but for a 10% subset of all 
reflections. 



and Mol A, and 0.63 A between the recombinant 
molecule and Mol B. These values are larger than 
the root-mean-square deviation between Mol A 
and Mol B (0.27 A). From Figure 7, we can see that 
the intermolecular interactions remarkably affect 
the structure of the N-terminal and loop regions of 
the protein, especially between residues 105 and 
110, but that the overall structures of the recombi- 
nant and authentic proteins are essentially identi- 
cal, supporting previous observations of the same 
CD spectra of the proteins in solution. 

The structures around the N termini of the 
recombinant protein and the two molecules of the 
authentic protein are shown in Figure 8, and we 
may see structural differences that give rise to the 
differences in the electric properties and stability 
between them. The N-terminal amino group 
strongly interacts with the side-chain atoms of 
Thr38 and Gln39 in Mol A (Figure 8(a)) and Thr38 
in Mol B through hydrogen bonds and/or salt 
bridges (Figure 8(b)). A similar interaction can also 
be observed in the recombinant protein, in which 
the N-terminal amino group is bound to the side- 
chain of Gln39 by a hydrogen bond (Figure 8(c)), 
but this interaction may be significantly stronger 
than the corresponding interaction in the authentic 



Figure 6. The molecular packings of recombinant and 
authentic goat oc-lactalbumin in the crystals. The main- 
chain atoms of Mol A (orange) were superimposed on 
those of the recombinant protein molecule (blue). The 
same transformation matrixes were applied on Mol B 
(yellow). The Figure shows two of the symmetry-related 
recombinant protein molecules, and Mol A and B are 
overlaid. The space group was P2,2 t 2 in the recombi- 
nant protein crystal and P2 ? in the authentic protein 
crystal. The interactions in both crystals were very simi- 
lar. 



protein (see Discussion). It can also be seen from 
Figure 8(c) that the methionine side-chain of the 
recombinant protein is directly in contact with 
the side-chain of Gln2, and that the orientation of 
the methionine side-chain is fixed by the hydrogen 
bonds between the N-terminal amino group and 
the side-chain of Gln39, and between the main- 




60 80 
Residue number 

Figure 7. The structural differences between the corre- 
sponding C* atoms of the recombinant and authentic 
protein molecules. Differences were observed in the 
N-terminal residues and the flexible loop residues of 
105-110. The loop residues of the recombinant protein 
were affected by the neighboring molecules in the crys- 
tal. 
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Stabilities of recombinant and authentic 
a-lactalbumin 

Structure around the N terminus 
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Glul of the authentic protein is hydrogen-bonded 
with two side-chain oxygen atoms of Thr38 and 
Gln39 in Mol A and with a side-chain oxygen 
atom of Thr38 in Mol B. A similar hydrogen bond 
is also observed in the recombinant protein 
between the N-terminal amino group and Gln39, 
and the length of the hydrogen bond is smaller 
than that in the authentic protein, suggesting that 
the hydrogen bond is even stronger in the recombi- 
nant protein (Figure 8(c)). Although the hydrogen 
bond between the N-terminal amino group and 
Thr38 is missing in the recombinant protein, there 
is an alternative hydrogen bond between the main- 
chain amido group of Glul and the carboxyl group 
of Asp37. The degrees of the packing interactions 
of the side-chain atoms are also very similar in the 
N-terminal regions of the two proteins. The side- 
chains are closely packed in both proteins. Further- 
more, contributions of electrostatic interactions 
around the N termini to the destabilization of the 
recombinant protein will be shown to be negligibly 
small, although they are related to the difference in 
the electric net charge between the proteins (see 
below). Therefore, the observed destabilization 
cannot be interpreted in terms of the presumed 
interactions missing in the native structure of the 
recombinant protein. 

Conformational entropy of the methionine residue 
and solvation free energies 

If the destabilization of the recombinant protein 
cannot be simply explained by the interactions 
identified in the X-ray structures of the recombi- 
nant and authentic proteins, what makes the 
recombinant protein less stable? At this point, it 
should be noted that the N-terminal residues of 
both the recombinant and authentic proteins are 
involved in a rigid structure, so that all the atoms 
of the residues can be traced in the electron density 
maps of the proteins by X-ray crystallographic 
analysis. The B-factors of the backbone atoms of 
the N-terminal methionine residue of the recombi- 
nant protein were found to range from 31 to 35 A 2 . 
The values are much larger than those of the resi- 
dues buried inside the protein molecule (8-15 A 2 ), 
but are smaller than those of the fully exposed resi- 
dues in flexible loop regions. This means that the 
presence of the additional methionine residue in 
the recombinant protein destabilizes the native 
state through an entropic effect, which arises from 
an additional conformational entropy of the meth- 
ionine residue in the U state. Because the structure 
around the N terminus is rigid in the N state of the 
recombinant protein, the additional methionine 
residue leads to an increase in entropy on unfold- 
ing. Thus, the free energy change of unfolding 
(AGu), which is the difference in the free energy 
between the N and U states, decreases, and hence 
the N state of the recombinant protein is destabi- 
lized. 



estimated at 20 cal/(mol K) by Oobatake & n 
(1993) from an analysis of hydration and h 
stability effects on the unfolding of 14 gj oh f * 
proteins, and this corresponds with the free pi? 
change of -5.9 kcal/mol at 25 'C. This vab^ 
close to but lower than the observed diffo. e 1S 
(AAG r -3.5 kcal/mol) in AG. between Z 
recombinant and authentic proteins. We h 
however, ignored the contribution of the hydrati^' 
free energy, AGfJ, and the enthalpic contribution df 
the conformational unfolding, AH", which mostlv 
arises from the van der Waals interaction ener^v 
to the AAG U (Oobatake & Ooi, 1993). These contri 
butions are expected to be proportional to the 
change in the accessible surface area of the meth 
lonine residue on unfolding (Oobatake & Ooi 
1993) and may explain the above difference 
between the expected contribution of the confor 
mational entropy (-TAS") and the observed 
AAG U . The values of AG£ and AH? of the N-term- 
inal methionine residue were calculated by the 
method described by Oobatake & Ooi (1993) and 
they were -1.2 and 3.3 kcal/mol for AGJJ,' and 
AH?, respectively, so that the free energy change 
of unfolding of the methionine residue (AG U ) was 
estimated at -3.8 kcal/mol (see equation (8)), 
which was in good agreement with the observed 
A AGu (see Materials and Methods). The contri- 
bution of other residues to the AAG V was also esti- 
mated, and it was less than 1 kcal/mol (see 
Materials and Methods), confirming that the 
increase in the conformational entropy of the N- 
terminal methionine residue on unfolding is a 
dominant factor determining the AAG V . 

In the above argument of AAG V/ however, we 
have implicitly assumed that the U state is fully 
unfolded in both the recombinant and authentic 
proteins. Thus, if there is a difference in the U-state 
structure between the proteins, such a difference 
may also contribute to the AAG y . In fact, the m 
value of the equilibrium unfolding transition is 
found to be smaller for the recombinant protein 
(Table 1). Lower values of m are usually thought to 
be due to less exposure of hydrophobic surface on 
unfolding. Because the native structure is essen- 
tially identical between recombinant and authentic 
oc-lactalbumin, the less exposure of hydrophobic 
surface must be due to a difference in the U-state 
structure, and the U state of the recombinant pro- 
tein less exposes the hydrophobic surface than that 
of the authentic one. Similar effects of hydrophobic 
replacements of amino acid residues on the U-state 
structure have also been reported in staphylococcal 
nuclease (Shortle, 1996). The less difference in sol- 
vent exposed hydrophobic surface means a smaller 
difference in AG U . Therefore, this may also be a 
factor determining the AAG V between the recom- 
binant and authentic proteins. 

Comparison with other proteins 
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authentic bovine a-lactalbumin using thermal 
Lnaturation measurements of the proteins. They 
have shown that the destabilization of the recombi- 
nant protein is caused by an entropic effect because 
enthalpy change of the thermal unfolding is the 
same for the two proteins, and their result is fully 
consistent with our proposal regarding the destabi- 
lization of the recombinant protein described 
above. Although Ishikawa et al (1998) have attrib- 
uted the destabilization of the recombinant protein 
t0 a weakening of the apparent Ca 2+ -binding 
strength, this interpretation seems to be nothing 
more than a rephrasing of the destabilization of the 
protein because the apparent Ca 2+ -binding 
strength of a-lactalbumin is known to be linked to 
^eN^U equilibrium of the apo protein (Hiraoka 
& Sugai, 1985). Our X-ray structural data show 
that there is no essential difference in the structure 
of the Ca 2_f -binding site between the authentic and 
recombinant proteins, indicating that the weaken- 
ing of the apparent Ca 2+ ~binding strength of the 
recombinant protein is caused by a destabilization 
of its apo form. 

There have been several other reports of the 
effect of additional residues at the N terminus on 
the native-state stability of recombinant proteins, 
and a comparison of these with the present results 
will provide insight into a rule relating to the 
effects of an extra methionine residue in the pro- 
teins. Hargrove et al (1994) have observed that the 
recombinant apomyoglobin expressed in E. colt is 
less stable than the authentic protein. They have 
also shown that the N terminus of recombinant 
apomyoglobin contains an extra methionine resi- 
due and that the structure around the N terminus 
is rigid. Polyhistidine tags in the N and C-terminal 
regions of Arc repressor (Milla et al, 1993, 1995) 
have little effect on the stability and folding of the 
protein, whereas the polyhistidine tags of CspA 
alter the folding behavior by interacting with the 
wild-type portion of the protein (Reid et aL, 1998). 
The X-ray crystallographic structures of the Arc 
repressor (Raumann et al, 1994) and CspA 
(Goldstein et al, 1990) have shown that the struc- 
ture around the N-terminal residue in CspA is 
rigid, whereas that of Arc repressor is flexible. The 
N-terminal region of staphylococcal nuclease is 
flexible (Hynes & Fox, 1991), and it has been 
reported that a 19-residue pro-peptide in the 
N-terminal region of the nuclease does not signifi- 
cantly destabilize the N state of the recombinant 
protein (pro staphylococcal nuclease; Suciu & 
*nouye, 1996). Therefore, these studies together 
with the our study strongly suggest that when the 
structure around the N-terminal residue of a pro- 
tein is rigid, the addition of extra residues at the N 
terminus destabilizes the N state of the protein. On 
the other hand, when the structure is flexible, the 
extra residues do not interfere with the native-state 
stability. From these experimental results, we can 
*us conclude that when the N-terminal region of 



residue, destabilizes the N state, but that when the 
N-terminal region is flexible, expression of the pro- 
tein by E. coli does not interfere with the native- 
state stability. 

Electric properties of authentic and 
recombinant a-lactalbumin 

The results of the electrophoresis and ion- 
exchange chromatography show that the recombi- 
nant protein is more negatively charged than the 
authentic one. It is understood, however, that the 
side-chain of a methionine residue does not ionize 
at neutral pH, so that there is no difference in the 
number of ionizable groups between the authentic 
and recombinant proteins. In fact, our electrophor- 
esis and ion-exchange chromatography data show 
that there is no difference in the electric charge 
between the proteins in the presence of 8 M urea. 
This means that some of the ionizable groups that 
have a pK a near 7.0 experience a change in pK a 
due to the structural folding of the protein. There 
are two such ionizable groups, the imidazole 
group of histidine and the N-terminal amino 
group, which have intrinsic pK a values of 6.5 and 
8.0, respectively. If we compare the X-ray struc- 
tures of the two proteins, no significant differences 
are observed near the histidine side-chains. How- 
ever, there is a noticeable difference in the struc- 
tures around the N-terminal amino groups. The 
N-terminal amino group is hydrogen-bonded to 
the oxygen atom of Thr38 and is closer to the side- 
chains of Asp37 and Asp83 in the authentic protein 
(Figure 8), and both of these may increase the pX a 
value of the N-terminal amino group through 
electrostatic interactions. A study of the pH-depen- 
dence of the unfolding transition of authentic 
bovine a-lactalbumin has shown that the N-term- 
inal amino group of the protein has an abnormally 
high pK a value (pK a = 8.9) in the N state, which is 
normalized on unfolding from the N to the molten 
globule state (Kuwajima et al, 1981). 

It should also be mentioned that the ApK a of the 
N-terminal group between the recombinant and 
authentic proteins leads to a difference in the 
native-state stability between the proteins, but this 
stability difference is expected to be much smaller 
than the AAG V estimated from equation (6) at 
pH 7.0. The stability difference (AAGy(ApK a )) due 
to the ApK a is known to be given by: 

AAGu(ApKa) - RTln[(l - K a (rec)/[H + ])/ 

(1 - K a (auth)/[H + ])] (6) 

where K a (rec) and K a (auth) are the dissociation 
constants of the N-terminal amino groups of the 
recombinant and authentic proteins, respectively, 
and [H + ] is the hydrogen-ion concentration 
(Tanford, 1970). If we assume that the pX a (rec) and 
pK a (auth) are 8.0 and 8.9, respectively, the above 
equation gives a A AG u (ApK a ) of 0.06 kcal/mol at 
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N-terminal amino groufrreasonably interprets the 
differences in the electric properties between the 
proteins observed by electrophoresis and ion- 
exchange chromatography, but it is not sufficient 
for interpreting the stability difference between the 
proteins. 



Folding of goat a-lactalbumin 

Because the presence of the N-terminal methion- 
ine residue in the recombinant protein changes the 
thermodynamic stability of the native state, this 
system is useful for investigating the role of the N 
terminus in the folding of a-lactalbumin. We thus 
investigated the refolding and unfolding kinetics of 
the proteins by stopped-flow CD measurements. 
The results show that the rate of unfolding of the 
recombinant protein is faster than that of the auth- 
entic protein (Table 2), whereas the refolding rates 
are very similar in the two proteins (Table 3). This 
shows that the stability difference is caused by the 
enhanced unfolding rate of the recombinant 
protein, and this is interpreted in terms of the 
difference in the free energy of the unfolding tran- 
sition (AAG y ) and the difference in the activation 
free energy (AAGjj) of unfolding. The A A Go is 
known to be given by the ratio of the unfolding 
rate constants as: 



AAG^-KTln 



Mrec) 



Jc u (auth)J 



(7) 



where fc u (rec) and Uauth) represent the unfolding 
rate constants for the recombinant and authentic 
proteins, respectively. Because /c u (rec) is 57 times 
larger than * u (auth) at 5.4 M GdnHCl, AAG L is 
estimated to be 1.0 kcal/mol, and this value is 
nearly identical with the AAG V (0.8 kcal/mol) at 
the same concentration of the denaturant. Thus, 
the stability difference between the proteins can be 
fully interpreted in terms of the increase in the 
unfolding rate of the recombinant protein. This 
means that the structure around the mutation site, 
the N terminus in this case, has not yet been orga- 
nized in the transition state of refolding in a-lactal- 
bumin (Kuwajima et al, 1989; Matouschek et al, 
1989; Serrano et al, 1992). The folding initiation 
site of a-lactalbumin is thus not located in the 
N-terminal region of the protein. Previous studies 
have shown that the structure around the 6-120 
disulfide bond and that around the B helix, both of 
which are involved in the a-domain of this protein, 
have not yet been organized in the transition state 
of refolding (Ikeguchi et al, 1998; T. Y. et al, 
unpublished data), while the structure around the 
Ca* + -binding site is known to be already organized 
in the transition state (Kuwajima et al, 1989). Our 
results thus provide further support for the 
proposition that the folding initiation site of a-lac- 
talbumin is located at the interface between the 
a and p-domains, around the Ca 2+ -binding site of 
the protein. 



MaterialsTKd Methods 

Ch micals 

GdnHCl was of a specially prepared reagent grade fo r 
biochemical use from Nacalai Tesque, Inc. (Kyoto). The 
concentration of GdnHCl was determined from the 
refractive index at 589 nm with an Atago 3T refract- 
ometer (Pace, 1986). Cyanogen bromide (CNBr) Wa * 
purchased from Nacalai Tesque Inc. (Kyoto). Authentic 
goat a-lactalbumin was prepared from fresh goat milk 
by the method described (Kuwajima et a/., !9gQ) 
A Resource™-Q anion exchange column was purchased 
from Pharmacia Biotechnology, Inc. (Sweden) and a ^ 
BONDASPHERE 5 u C4 300 A reversed-phase column 
was supplied by Nihon Waters Ltd (Japan). 

Expression and purification of recombinant goat 
a-lactalbumin 

The expression system of goat a-lactalbumin and the 
procedures for the refolding and purification of the pro- 
tein have been reported by Kumagai et al (1990) and 
recently improved by Uchiyama et al. (1995) utilizing a 
T7 promoter (Studier & Moffatt, 1986). In brief, the pro- 
tein expressed in E. coli BL21(DE3) as inclusion bodies 
was solubilized in 8 M urea containing 20 mM Tris-HCl 
(pH 8.0) and first purified using a DEAE-Sepharose FF 
column. The eluted protein was reduced by 50 mM 
dithiothreitol and dialyzed against 20 mM Tris-HCl 
(pH 8.0) at 4°C to remove urea. Refolding of the reduced 
oc-lactalbumin was performed as described (Sawano et fl/., 
1992), with slight modifications, in a solution containing 
20% '(v/v) glycerol, 20 mM Tris-HCl (pH 8.0), 1 mM 
CaCl 2 , 6 mM glutathione, 0.6 mM oxidized glutathione, 
3.3 uM a-lactalbumin at 15 C for more than 20 hours. 
The refolding process was monitored by the appearance 
of a sharp peak on a reversed-phase HPLC chromato- 
oram detected by UV-absorbance at 215 nm using a C4 
column with a linear gradient elution of 28%-52% aceto- 
nitrile in the presence of 0.1 % (v/v) trifluoroacetic acid 
at a flow rate of 0.5 ml per minute. The refolded protein 
was then purified by DEAE-Sepharose FF and phenyl- 
Sepharose CL column chromatographies as described by 
Lindahl & Vogel (1984). Concentrations of authentic and 
recombinant goat a-lactalbumin were determined 
spectrophotometrically using an extinction coefficient ot 
EJ % = 20.1 for both (Kuwajima et aL, 1980). No free 
cysteinyl residues were detected in the folded recombi- 
nant protein by thiol content analysis (Ellman, 195V; 
Riddle et al, 1979). 

Preparation of methionine-free recombinant goat 
a-lactalbumin 

The methionine-free protein was prepared ac corc} in S 
to the method described by Kim et al. (1997) with slight 
modifications. Recombinant goat a-lactalbumin was dis 
solved in 70% (v/v) formic acid and treated m n 
100 mM CNBr (50-100-fold molar excess over the proteu 
concentration) for 24 hours in the dark at room tempera- 
ture. The cleaved product was diluted ten t,m _?. wl ^ 
water and dialyzed against 10 mM HO, then dialyzed 
against 10 mM Tris-HCl (pH 8.5) containing 1 rm 
CaCl 2 . Finally, the protein solution was punhed on 
Q-Sepharose FF column, which had been ^""ib™ 
with 20 mM Tris-HCl (pH S3) containing 1 mM 
and eluted with a linear gradient of NaCl trorn 0 M 
05 M. The mobilities and retention times of t.ie e 
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actions were checked by native PAGE and anion- 
change HPLC, and compared with those of the auth- 
^tic protein under the same conditions. The mass of the 
ethionine-cleaved protein was determined by mass 
trometric analysis, and the removal of the N-term- 
- : al methionine residue was confirmed by the N-term- 
al sequence analysis. The concentration of the CNBr- 
Jeaved protein was calculated using the same extinction 
Inefficient as that given above. 

lass spectrometric analysis 

Mass spectrometric analyses of the authentic, recom- 
binant and methionine-free proteins were carried 
Scut by the MALDI-TOF-MS mass spectroscopic method. 
:%inapinic acid mix protein samples were used as the 
Vlmatrix, and the spectra were taken in Reflex (Bruker). 

■ N-terminal sequence analysis 

N-terminal sequencing of recombinant, authentic, and 
. CNBr-cleaved proteins were carried out using an auto- 
mated Applied Biosystem sequencer model 477a 
"equipped with a model 120A on-line PTH amino acid 
analyzer. In this study we analyzed the first five residues 
in the proteins. 

Equilibrium CD measurements 

I Equilibrium CD spectra were taken on a Jasco J-720 
spectropolarimeter using an optical cuvette with a path 
length of 1.00 mm for measurements in the peptide 
region and 10.0 mm for measurements in the aromatic 
region. The CD spectra of the protein were measured in 
50 mM sodium cacodylate, 50 mM NaCl (pH 7.0) con- 
taining 1 mM CaCl 2 . The solutions for the GdnHCl- 
induced equilibrium unfolding studies were prepared in 
the same buffer containing various concentrations of 
GdnHCl. The mean residue ellipticity was calculated as 
a function of GdnHCl concentration at 25 °C by taking 
113 as the mean residue mass. The protein concentration 
in the equilibrium measurements was 0.15-0.2 mg/mL 
. The apparent fractional extent (F app ) of unfolding was 
calculated by: 



'obs ' 



app 



On — (k 



(8) 



where 0^ is the observed ellipticity, and 0 N and Q v are 
the ellipticities in the native (N) and the fully unfolded 
(U) states, respectively. The 0 N and d v values are 
assumed to linearly depend on the GdnHCl concen- 
tration (C) as 8 N = 0, + a, C and d v = 6 2 + a ? C. The N 
state baseline was calculated from the ellipticity values 
between 0.5 and 2 M and between 0.4 and 1.8 M 
GdnHCl, and the U state baseline was from the values 
between 4.5 and 6.2 M and between 3.8 and 6.2 M 
GdnHCl for the authentic and recombinant proteins, 
respectively. 

Kinetic measurements 

Refolding and unfolding reactions of the authentic 
and recombinant proteins were induced by GdnHCl con- 
centration jumps, which were performed by a s topped - 
fl ow CD apparatus (UNISOKU Inc., Japan) installed in 
the cell cornea rtment of the T-720 spectropolarimeter 



sodium cacodylate at pH 7.0 and 25 °C The dead time of 
the stopped-flow CD apparatus was 25 ms when a 
4 mm cuvette was used. The concentration of the protein 
stock solution was about 1.5-2.0 mg/mL The initial pro- 
tein solutions before the concentration jump contained 
1-0 M and 5.5 M GdnHCl for unfolding and refolding 
experiments, respectively. The diluent solution contained 
the same buffer (50 mM sodium cacodylate, 50 mM 
NaCl, and 1 mM CaCl 2 , pH 7.0) and an appropriate con- 
centration of GdnHCl. The two solutions were mixed 
with a mixing ratio of 1:10. 

X-ray crystallographic studies 

The crystal of recombinant goat a-lactalbumin was 
grown by the vapor diffusion method with a hanging 
drop in a chamber where the temperature was controlled 
at 20 °C The data were collected by an automated area 
detector system, DIP2000, on an X-ray generator with a 
bent mirror system at 9.5 °C Data processing and 
reduction was performed using DENZO and SCALE- 
PACK programs (Otwinowski, 1993). The crystallo- 
graphic data, the diffraction intensity statistics, and the 
refinement statistics are listed in Table 4. The crystal 
structure was solved on the basis of the model structure 
of baboon a-lactalbumin (Acharya et ah, 1989) by the 
molecular replacement method (Briinger, 1990) and was 
refined by a slow-cooling molecular-simulated annealing 
method in the X-PLOR 3.1 program suite (Brunger, 
1992). 

Theoretical estimation of AAG y between 
recombinant and authentic goat a-lactalbumin 

The AAGu value was calculated by the method 
described by Oobatake & Ooi (1993). In this calculation, 
every atom was identified as belonging to one of seven 
atomic groups: aliphatic C, aromatic C, hydroxyl O, 
amide N, carbonyl C, carbonyl O, and sulphur S. In 
addition, the accessible surface area (ASA) of each atom 
in the N state (except hydrogen) was calculated by the 
method described by Richmond (1984) using the coordi- 
nates of the X-ray crystal structures. Because the N-term- 
inal methionine residue is present only in the 
recombinant protein, the AAGy was assumed as a first 
approximation to be equal to the free energy change of 
unfolding (AG U ) of the methionine residue. For the ASA 
of atoms in the methionine residue in the U state, the 
values calculated by Shrake & Rupley (1973) were used. 
It was also assumed that the AG]!, and AH" are pro- 
portional to the change in the ASA (Aot, for the ;th atom- 
ic group) of the atoms on unfolding according to 
Oobatake & Ooi (1993). Thus: 

AG U =AG[J + AG" 

AG£ -Hfgi^Aoii 

AG C U =AH C U - TAS C U 

AH" =£A. c Atx; (9) 

where g iM and h ic are proportionality constants for the 
seven atomic groups. Although the change in the confor- 
mational entropy, AS", was also assumed to be pro- 
portional to the Aot, values in the original Oobatake & 
Ooi (1993) method, this assumption may not be correct 
for the extra methionine residue of recombinant goat 
ot-lactalbumin due to the rigid nature of this residue as 



the distance between the O atom of the methyl side- 
chain of MetO and the C & atom of Gln2 side-chain being 
3.5 A. Moreover, the N-terminal amino group in the 
recombinant protein is hydrogen-bonded with the carbo- 
nyl oxygen atom of the Gln39 side-chain. Thus the 
-7AS C U value (-5-9 kcal/mol) obtained from Table 8 of 
Oobatake & Ooi (1993) was employed (see Discussion). 

The contribution of other residues to the A AG y value 
was also estimated by (g iM + g,- c )Ao£^ where g ic is a 
proportionality constant and Aot, is the difference in the 
ASA value of the ith atomic group between the authentic 
and recombinant proteins in the N state. Here, -TAS" 
was assumed to be proportional to the change in the 
ASA values following the original Oobatake & Ooi 
(1993) method. The values obtained for Mol A and Mol 
B of the crystal structure of the authentic protein were 
averaged. Since Glul of the authentic protein is more 
exposed to solvent in the unfolded state, the difference 
in the ASA of the atoms of Glul between the authentic 
and recombinant proteins in the unfolded state (Aa^) 
was also taken into account for the estimation of AAGy 
as £, ig iM +gi,c) A <*¥ using the ASA values of Shrake & 
Rupley (1973). The contribution of the other residues to 
the AAG V thus estimated has been found to be less than 
1 kcal/mol. 



Protein Data Bank accession number 

The coordinates have been deposited in the 
Brookhaven Protein Data Bank with accession number 
Ihmk. 
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